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(54) Tide: MODIFIED PEPTIDES AS THERAPEUTIC AGENTS 



(57) Abstract 



The present invention concerns fusion of Fc domains with biologically active peptides and a process for preparing pharmaceutical 
agents using biologically active peptides. In this invention, pharmacologically active compounds arc prepared by a process comprising: a) 
selecting at least one peptide that modulates the activity of a protein of interest; and b) preparing a pharmacologic agent comprising an Fc 
domain covalently linked to at least one amino acid of the selected peptide. Linkage to the vehicle increases the half-life of the peptide, 
which otherwise would be quickly degraded in vivo. The preferred vehicle is an Fc domain. The peptide is preferably selected by phage 
display, E. coli display, ribosome display, RNA-peptide screening, or chemical-peptide screening. 
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Modified Peptides as Therapeutic Agents 
Background of the Invention 

Recombinant proteins are an emerging class of therapeutic agents. 
5 Such recombinant therapeutics have engendered advances in protein 
formulation and chemical modification. Such modifications can protect 
therapeutic proteins, primarily by blocking their exposure to proteolytic 
enzymes. Protein modifications may also increase the therapeutic 
protein's stability, circulation time, and biological activity. A review 

1 0 article describing protein modification and fusion proteins is Francis 
(1992), Focus on Growth Factors 3:4-10 (Mediscript, London), which is 
hereby incorporated by reference. 

One useful modification is combination with the "Fc" domain of an 
antibody. Antibodies comprise two functionally independent parts, a 

1 5 variable domain known as 'Tab", which binds antigen, and a constant 
domain known as "Fc", which links to such effector functions as 
complement activation and attack by phagocytic cells. An Fc has a long 
serum half-life, whereas an Fab is short-lived. Capon et al. (1989), Nature 
337: 525-31. When constructed together with a therapeutic protein, an Fc 

2 0 domain can provide longer half-lif e or incorporate such functions as Fc 
receptor binding, protein A binding, complement fixation and perhaps 
even placental transfer. Id. Table 1 summarizes use of Fc fusions known in 
the art. 
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Table 1— Fc fusion with therapeutic proteins 



Form of Fc 


Fusion 
partner 


Therapeutic 
implications 


Reference 


IgGi 


N-terminus of 
CD30-L 


Hodgkin's disease; 
anaplastic lymphoma; T- 
cell leukemia 


U.S. Patent No. 
5,480,981 


Murine rcy^a 


1L-10 


anti-inflammatory; 
transplant rejection 


Zheng et al. (1995), J* 
Immunol. 154: 5590-600 




1 111 lyvOptvi 


septic shock 


Fisher fiLaL (1996), 
Engl. J, Med. 334:1697- 
1702; Van Zee, K. fiLaL 
(1996), J. lmmunoU56: 

2221-30 


IgG, IgA, 
IgM, or IgE 
(excluding 
the first 
domain) 


TNF receptor 


inflammation, autoimmune 
disorders 


U.S. Pat. No. 5,808.029, 
issued September 15, 
1998 


lgG1 


CD4 receptor 


AIDS 


Capon fiLaL (1989), 
fslatur** 337: 525-31 


lgG1, 
taG3 


N-terminus 
of IL-2 


anti-cancer, antiviral 


Harvill fiLaL (1995), 
Immunotech. 1 : 95-1 05 


IgGi 


C-terminus of 
OPG 


osteoarthritis; 
bone density 


WO 97/23614, published 
July 3, 1997 


IgGi 


N-terminus of 
leptin 


anti-obesity 


PCT/US 97/23183, filed 
December 11, 1997 


Human Ig 
Cy1 


CTLA-4 


autoimmune disorders 


Linsley (1991). JJExfi, 
Med. 174:561-9 



A much different approach to development of therapeutic agents is 
peptide library screening. The interaction of a protein ligand with its 
5 receptor often takes place at a relatively large interface. However, as 

demonstrated for human growth hormone and its receptor, only a few key 
residues at the interface contribute to most of the binding energy. 
Clackson etal. (1995), Science 267: 383-6. The bulk of the protein ligand 
merely displays the binding epitopes in the right topology or serves 
1 0 functions unrelated to binding. Thus, molecules of only "peptide" length 
(2 to 40 amino acids) can bind to the receptor protein of a given large 
protein ligand. Such peptides may mimic the bioactivity oFtheiarge 
protein ligand ("peptide agonists") or, through competitive binding, 
inhibit the bioactivity of the large protein ligand ("peptide antagonists"). 
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Phage display peptide libraries have emerged as a powerful 
method in identifying such peptide agonists and antagonists. See, for 
example, Scott et al . (1990), Science 249: 386; Devlin etal. (1990), Science 
249: 404; U.S. Pat. No. 5,223,409, issued June 29, 1993; U.S. Pat. No. 
5 5,733,731, issued March 31, 1998; U.S. Pat. No. 5,498,530, issued March 12, 
1996; U.S. Pat. No. 5,432,018, issued July 11, 1995; U.S. Pat. No. 5,338,665, 
issued August 16, 1994; U.S. Pat. No. 5,922,545, issued July 13, 1999; WO 
96/40987, published December 19, 1996; and WO 98/15833, published 
April 16, 1998 (each of which is incorporated by reference). In such 

1 0 libraries, random peptide sequences are displayed by fusion with coat 
proteins of filamentous phage. Typically, the displayed peptides are 
affinity-eluted against an antibody-immobilized extracellular domain of a 
receptor. The retained phages may be enriched by successive rounds of 
affinity purification and repropagation. The best binding peptides may be 

1 5 sequenced to identify key residues within one or more structurally related 
families of peptides. See, e.g., Cwirla etal. (1997), Science 276: 1696-9, in 
which two distinct families were identified. The peptide sequences may 
also suggest which residues may be safely replaced by alanine scanning or 
by mutagenesis at the DNA level. Mutagenesis libraries may be created 

2 0 and screened to further optimize the sequence of the best binders. 
Lowman (1997), Ann. Rev. Biophvs. Biomol. Struct. 26: 401-24. 

Structural analysis of protein-protein interaction may also be used 
to suggest peptides that mimic the binding activity of large protein 
ligands. In such an analysis, the crystal structure may suggest the identity 

2 5 and relative orientation of critical residues of the large protein ligand, 
from which a peptide may be designed. See, e.g., Takasaki et al. (1997), 
Nature Biotech. 15: 1266-70. These analytical methods may^also be used to 
investigate the interaction between a receptor protein and peptides 



3 
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selected by phage display, which may suggest further modification of the 
peptides to increase binding affinity. 

Other methods compete with phage display in peptide research. A 
peptide library can be fused to the carboxyl terminus of the lac repressor 

5 and expressed in E. coli . Another E. coli -based method allows display on 
the cell's outer membrane by fusion with a peptidoglycan-associated 
lipoprotein (PAL). Hereinafter, these and related methods are collectively 
referred to as " E. coli display." In another method, translation of random 
RN A is halted prior to ribosome release, resulting in a library of 

1 0 polypeptides with their associated RNA still attached. Hereinafter, this 
and related methods are collectively referred to as "ribosome display." 
Other methods employ chemical linkage of peptides to RNA; see, for 
example, Roberts & Szostak (1997), Proc. Natl. A cad. Sci. USA, 94: 12297- 
303. Hereinafter, this and related methods are collectively referred to as 

1 5 "RN A-peptide screening." Chemically derived peptide libraries have been 
developed in which peptides are immobilized on stable, non-biological 
materials, such as polyethylene rods or solvent-permeable resins. Another 
chemically derived peptide library uses photolithography to scan peptides 
immobilized on glass slides. Hereinafter, these and related methods are 

2 0 collectively referred to as "chemical-peptide screening." Chemical-peptide 
screening may be advantageous in that it allows use of D-amino acids and 
other unnatural analogues, as well as non-peptide elements. Both 
biological and chemical methods are reviewed in Wells & Lowman (1992), 
Curr. Opin. Biotechnol. 3: 355-62. 

2 5 Conceptually, one may discover peptide mimetics of any protein 

using phage display and the other methods mentioned above. These 
methods have been used for epitope mapping, for identification of critical 
amino acids in protein-protein interactions, and as leads for the discovery 
of new therapeutic agents. E.g., Cortese et al. (1996), Curr. Opin. Biotech. 7: 



- 3 %r^> 
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616-21. Peptide libraries are now being used most often in immunological 
studies, such as epitope mapping. Kreeger (1996), The Scientist 10(13): 19- 
20. 

Of particular interest here is use of peptide libraries and other 
5 techniques in the discovery of pharmacologically active peptides. A 

number of such peptides identified in the art are summarized in Table 2. 
The peptides are described in the listed publications, each of which is 
hereby incorporated by reference. The pharmacologic activity of the 
peptides is described, and in many instances is followed by a shorthand 
1 o term therefor in parentheses. Some of these peptides have been modified 
(e.g., to form C-terminally cross-linked dimers). Typically, peptide 
libraries were screened for binding to a receptor for a pharmacologically 
active protein (e.g., EPO receptor). In at least one instance (CTLA4), the 
peptide library was screened for binding to a monclonal antibody. 
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Table 2- 


Pharmacologically active peptides 


Form of 
peptide 


Binding 
partner/ 
protein of 
interest* 


Pharmacologic 
activity 


Reference 


intrapeptide 
disulfide- 
bonded 


EPO receptor 


c r~ v-j-i 1 1 1 n 1 1* u i* 


Wrighton et al. (1996), 
ScifiOfifi 273: 458-63; 
U.S. Pat. No. 5,773,569, 
issued June 30, 1998 to 
Wriphton et al. 


C-terminally 
cross-linked 
dimer 


EPO receptor 


EPO-mimetic 


Uvnah fiLal. (1996), 
Science 273: 464-71 : 
Wrighton fiLal- (1997), 
Matyr^BiotechnoloavIS: 
1P61-5* International 
patent application WO 
96/40772, published 
Dec. 19, 1996 


linear 


EPO receptor 


EPO-mimetic 


Naranda fiLal. (1999), 
Pmn Natl Acad Sci 
U$A. 96: 7569-74 


linear 


c-MpI 


TPO-mimetic 


Cwirla fiLal-(1997) 
Science 276: 1696-9; 
U.S. Pat. No. 5,869,451, 
issued Feb. 9, 1999; U.S. 
Pat. No. 5,932,946, 
issued Aug. 3. 1999 


C-terminally 
cross-linked 
dimer 


c-MpI 


TPO-mimetic 


Cwirla fiLal. (1997), 

Y^yicIlUO CI D. IOW <7 


disulfide- 
linked dimer 




stimulation of 
hematopoiesis 
( M G-CSF-mimetic B ) 


Paukovits fiLal. (1984), 
Hnppft-Sevlers Z. 
PhuQtnl Ohpm 365' 303- 
1 1 ; Laerum sLal- (1988). 
E*p. JHemat. 16: 274-80 


alkylene- 
linked dimer 




G-CSF-mimetic 


Bhatnagar fiLal. (1996), 
.1 IWlPd. Cham. 39: 3814- 
9; Cuthbertson fiLal* 
/1QQ7\ A Mfid.Chem. 
40: 2876-82; King fiLal- 
(1991) T Exp. Hematol. 
19:481; King fiLal* 
(1995), BlQfid 86 (Suppl. 
1): 309a 


linear 


IL-1 receptor inflammatory and 
autoimmune diseases 
("IL-1 antagonist" or 
u IL-1ra-mimetic") 


U.S. Pat. No. 5,608,035; 
U.S. Pat. No. 5,786.331 ; 
U.S^Pat. No. 5,880,096; . 
Yanofsky fiLal- (1996), 



• The Drotein listed in this column may be bound by the associated peptide (e.g.. EPO 
re^DtoML " receptor) or mimicked by the associated peptide. The references hsted for 
SiJy wSr the molecule is bound by or mimicked by the peptodes. 
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Proc, Natl, Acad. Scl 93: 
7381-6; Akeson fiLal. 
(1996), J. Biol. Chem . 
271:30517-23; 
Wiekzorek flLal. (1997), 
PoL J. Pharmacol. 49: 



107-17; Yanofsky (1996), 
PNAs, 93:7381 -7386. 



linear 


Facteur 
thymique 
serique (FTS) 


stimulation of 
lymphocytes 
("FTS-mimetic") 


Inagaki-Ohara at al. 
(1996), Ceiyar Immunol. 
171:30-40; Yoshida 
(1984), Int. J. 
Immunopharmacol. 
6:141-6. 


intrapeptide 
disulfide 
bonded 


CTLA4 MAb 


CTLA4-mimetic 


Fukumoto et aL (1998), 
Nature Biotech. 16: 267- 
70 


exocyclic 


TNF-a receptor 


TNF-a antagonist 


Takasaki aLal. (1997), 
Nature Biotech, 15:1266- 

70; WO 98/53842, 
published December 3, 
1998 


linear 


TNF-a receptor 


TNF-a antagonist 


Chirinos-Rojas ( ), J* 
f n m.. 5621-5626. 


intrapeptide 
disulfide 
bonded 


C3b 


inhibition of complement 
activation; autoimmune 
diseases 
( M C3b-antaaonisr) 


Sahu fiLai. (1996), J* 
Immunol. 1 57: 884-91; 
Morikis et al. (1998), 
Protein ScL 7: 619-27 


linear 


vinculin 


ceil adhesion processes — 
cell growth, differentiation, 
wound healing, tumor 
metastasis ("vinculin 
binding") 


Adey fiLal. (1997), 
Biochem, J. 324: 523-8 


linear 


C4 binding 
protein (C4BP) 


anti-thrombotic 


Linse fiLal. (1997), J* 
Biol, Chem. 272:14658- 

65 


linear 


urokinase 
receptor 


processes associated with 
urokinase interaction with 

its receptor (e.g., 
angiogenesis, tumor cell 
invasion and metastasis); 
(TIKR antagonist") 


Goodson fiLal. (1 994), 
Proc, Natl. Acad, Sci. 91: 
7129-33; International 
application WO 
97/35969, published 
October 2, 1997 


linear 


Mdm2, Hdm2 


Inhibition of inactivation of 
p53 mediated by Mdm2 or 

hdm2; anti-tumor 
( u Mdm/hdm antagonist") 


Picksley fiLal- (1994), 
Oncogene 9: 2523-9: 
Bottger fiLal. (1997) J, 
Mol. Biol. 269: 744-56; 
Bottger fiLal- (1996), 
Oncogene 13: 2141-7 


linear 


p21 wA * • 


anti-tumor by mimicking 
the activity of p21 WAFt 


BaH fiLal. (1997), Cum , 
Bifil. 7:71-80 


linear 


farnesyl 


anti-cancer by preventing 


Gibbs et al. (1994), Gall 



b FTS is a thymic hormone mimicked by the molecule of this invention rather than a 
receptor bound by the molecule of this invention. 

1 
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linear 



linear 



linear 



linear 



transferase activation of ras oncogene 



Ras effector 
domain 



anti-cancer by inhibiting 
biological function of the 
ras oncogene 



SH2/SH3 
domains 



pl6 



Src, Lyn 



anti-cancer by inhibiting 

tumor growth with 
activated tyrosine kinases 

anti-cancer by mimicking 
activity of p16; e.g., 
inhibiting cyclin D-Cdk 
complex ("pl6-mimetic") 
inhibition of Mast cell 
activation, IgE-related 
conditions, type I 



77:175-178 

Moodie et al. (1994), 

Trends Genet 10: 44-48 

Rodriguez et al. (1994), 
N|at yre 370:527-532 
Pawson et al(1993), 
Hurr. Biol. 3:434-432 
Yu et al. (1994), Cflll 
76:933-945 
Fahraeus fiLal. (1996), 
Curr. Biol . 6:84-91 



Stauffer fiLal. (1997), 
Biochem . 36: 9388-94 



™\\ antaqonist") — 


linear 


Mast cell 
protease 


treatment of inflammatory 
disorders mediated by 
release of tryptase-6 
("Mast cell protease 
inhibitors") 


International application 
WO 98/33812, published 
August 6, 1998 


linear 


SH3 domains 


treatment of SH3- 
mediated disease states 
( M SH3 antagonist") 


Rickles fiLm- (1994), 
£MB£U. 13: 5598-5604; 
Sparks fiLal- (1994), *L 
Rial. Chem. 269: 23853- 
6; Sparks fiLal- (1996), 
Prro? N flt! AMd Sc*- 93: 

1540-4 


linear 


HBV core 
antigen (HBcAg) 


treatment of HBV viral 
infections Canti-HBV) 


Dyson & Muray (1995), 
p r «r. Natl. Acad. Sci. 92: 
2194-8 


linear 


selectins 


neutrophil adhesion; 
inflammatory diseases 
("selectin antagonist") 


Martens fiLal. (1995), J* 
BiOl Chem. 270: 21129- 
36; European patent 
application EP0 714 
912, published June 5, 
1996 


linear, 
cyclized 


calmodulin 


calmodulin antagonist 


Pierce fiLal- (1995). 
Mninr.. Diversity 1:259- 
65; Dedman et al. 
(ium) A. Biol. Chem. 
268: 23025-30; Adey & 
Kay (1996), QfiQfi 169: 
133-4 


linear, 
cyclized- 


integrins 


tumor-homing; treatment 
for conditions related to 
integrin-mediated cellular 
events, including platelet 
aggregation, thrombosis, 
wound healing, 
osteoporosis, tissue 
repair, anqiogenesis (e.g. 


International applications 
WO 95/14714. published 
June" 1, 1995; WO 
97/08203, published 
March 6, 1997; WO 
98/10795, published 
March 19, 1998; WO 
99/24462. published May 
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for treatment of cancer), 
and tumor invasion 
nnteqrin-binding w ) 


20, 1999; Kraft eLal. 
(1999), J- Biol. Chem. 
274: 1979-1985 


cyclic, linear 


fibronectin and 
extracellular 
matrix 
components of T 
cells and 
macrophages 


treatment of inflammatory 
and autoimmune 
conditions 


WO 98/09985, published 
Marcn i^, iyyo 


linear 


somatostatin 
and cortistatin 


treatment or prevention of 
hormone-producing 
tumors, acromegaly, 
giantism, dementia, 
gastric ulcer, tumor 
growth, inhibition of 
hormone secretion, 
modulation of sleep or 
neural activity 


European patent 
application 0 91 1 393, 

nithlichaH Anril OR 1 QQQ 

puuiisnea April ^o, 1 999 


linear 


bacterial 
lipopolysac- 
charide 


antibiotic; septic shock; 
disorders modulatable by 
CAP37 


1 1 0 p~t No SA77 151 

issued March 2, 1999 


linear or 
cyclic, 
including D- 
amino acids 


pardaxin, mellitin 


anttpatnogenic 


\A/n Q7/**1 019 Dublished 
28 August 1997 


linear, cyclic 


VIP 


impotence, 
neurodegenerative 
disorders 


WO 97/40070, published 
October 30, 1997 


linear 


CTLs 


cancer 


Mav 2, 1997 


linear 


THF-gamma2 




Burnstein (1988), 
Biochem.. 27:4066-71. 


linear 


Amylin 




Pnnnor M QR7^ PrnP 
wOOpul \ 1 SO / /, riULi 

Natl Acad Sri 
84:8628-32. 


linear 


Adrenomedullin 




Kitamura M993) BBRC. 
192:553-60. 


cyclic, linear 


VEGF 


anti-angiogenic, cancer, 
rheumatoid arthritis, 

riiaKotir* rotinonathv 
nenriack f H \/EGF 

antagonist") 


1 ail Ui w 1 1 iwi y ■ 

fiiOCliflni., 37:17754- 
17764. 


cyclic 


MMP 


inflammation and 
autoimmune disorders; 
tumor growth 
("MMP inhibitor") 


Koivunen (1999), Natliri"" 
Biotech.. 17:768-774. 




HGH fraament 




U.S. Pat. No. 5,869.452 




Echistatin 


inhibition of platelet 
aggregation 


Can (1988),*L_BioL 
Cfiem.. 263:19827-32. ... 


linear 


SLE 
autoantibody 


SLE 


WO 96/30057, published 
October 3. 1996 




GD1 alpha 


suppression of tumor 
metastasis 


Ishikawa fiLal- (1998), 
FEBS Lett. 441 (1):20-4 




antiohosDholiDid 


findothelial cell activation , 


Blank etal. (1999), Emc 
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beta , 2 - antiphosphotipid Natl A^d, SfiC JJSA 96: 

glycoprotein-l syndrome (APS), 5164-8 

(P2GPI) thromboembolic 
antibodies phenomena, 

thrombocytopenia, and 
recurrent fetal loss 

Peptides identified by peptide library screening have been regarded 
as "leads" in development of therapeutic agents rather than as therapeutic 
agents themselves, like other proteins and peptides, they would be 
5 rapidly removed in vivo either by renal filtration, cellular clearance 

mechanisms in the reticuloendothelial system, or proteolytic degradation. 
Francis (1992), ™ Growth Factors 3: 4-11. As a result, the art 
presently uses the identified peptides to validate drug targets or as 
scaffolds for design of organic compounds that might not have been as 
1 o easily or as quickly identified through chemical library screening. 

Lowman (1997), a™ Rpv Biophvs. Biomol. Struct. 26: 401-24; Kay etal. 
(1998), nnt r Disc. Today 3: 370-8. The art would benefit from a process by 
which such peptides could more readily yield therapeutic agents. 

Summary of the Invention 
15 The present invention concerns a process by which the inviyo naif- 

life of one or more biologically active peptides is increased by fusion with 
a vehicle. In this invention, pharmacologically active compounds are 

prepared by a process comprising: 

a) selecting at least one peptide that modulates the activity of a 

2 o protein of interest; and 

b) preparing a pharmacologic agent comprising at least one 
vehicle covalently linked to at least one amino acid sequence 
of the selected peptide. 

The preferred vehicle is an Fc domain. The peptides screened in step (a) 
2 5 are preferably expressed in a phage display library. The vehicle and the 
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peptide may be linked through the N- or C-terminus of the peptide or the 
vehicle, as described further below. Derivatives of the above compounds 
(described below) are also encompassed by this invention. 

The compounds of this invention may be prepared by standard 
5 synthetic methods, recombinant DNA techniques, or any other methods of 
preparing peptides and fusion proteins. Compounds of this invention that 
encompass non-peptide portions may be synthesized by standard organic 
chemistry reactions, in addition to standard peptide chemistry reactions 
when applicable. 

10 The primary use contemplated is as therapeutic or prophylactic 

agents. The vehicle-linked peptide may have activity comparable to— or 
even greater than— the natural ligand mimicked by the peptide. In 
addition, certain natural ligand-based therapeutic agents might induce 
antibodies against the patient's own endogenous ligand; the vehicle-linked 

1 5 peptide avoids this pitfall by having little or typically no sequence identity 

with the natural ligand. 

Although mostly contemplated as therapeutic agents, compounds 
of this invention may also be useful in screening for such agents. For 
example, one could use an Fc-peptide (e.g., Fc-SH2 domain peptide) in an 
2 0 assay employing anti-Fc coated plates. The vehicle, especially Fc, may 
make insoluble peptides soluble and thus useful in a number of assays. 

The compounds of this invention may be used for therapeutic or 
prophylactic purposes by formulating them with appropriate 
pharmaceutical carrier materials and aclministering an effective amount to 
a patient, such as a human (or other mammal) in need thereof. Other 
related aspects are also included in the instant invention. 

Numerous additional aspects and advantages of the present 
invention will become apparent upon consideration of the figures and 
detailed description of the invention. 

I! 
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Brief Description of the Figures 

Figure 1 shows a schematic representation of an exemplary process 
of the invention. In this preferred process, the vehicle is an Fc domain, 
which is linked to the peptide covalently by expression from a DNA 
5 construct encoding both the Fc domain and the peptide. As noted in 
Figure 1, the Fc domains spontaneously form a dimer in this process. 

Figure 2 shows exemplary Fc dimers that may be derived from an 
IgGl antibody. "Fc" in the figure represents any of the Fc variants within 
the meaning of "Fc domain" herein. "X"' and "X 2 " represent peptides or 

1 o linker-peptide combinations as defined hereinafter. The specific dimers are 

as follows: 

A, D: Single disulfide-bonded dimers. IgGl antibodies typically 
have two disulfide bonds at the hinge region between the constant and 
variable domains. The Fc domain in Figures 2A and 2 D may be formed by 

1 5 truncation between the two disulfide bond sites or by substitution of a 
cysteinyl residue with an unreactive residue (e.g., alanyl). In Figure 2A, 
the Fc domain is linked at the amino terminus of the peptides; in 2D, at the 

carboxyl terminus. 

B, E: Doubly disulfide-bonded dimers. This Fc domain may be 

2 0 formed by truncation of the parent antibody to retain both cysteinyl 

residues in the Fc domain chains or by expression from a construct 
including a sequence encoding such an Fc domain. In Figure 2B, the Fc 
domain is linked at the amino terminus of the peptides; in 2E, at the 
carboxyl terminus. 
25 C, F: Noncovalent dimers. This Fc domain may be formed by 

elimination of the cysteinyl residues by either truncation or substitution. 
One may desire to eliminate the cysteinyl residues to avoid impurities 
formed by reaction of the cysteinyl residue with cysteinyl residues of other 



WO 00/24782 



PCT/US99/25044 



proteins present in the host cell. The noncovalent bonding of the Fc 
domains is sufficient to hold together the dimer. 

Other dimers may be formed by using Fc domains derived from different 
types of antibodies (e.g., IgG2, IgM). 
5 Figure 3 shows the structure of preferred compounds of the 

invention that feature tandem repeats of the pharmacologically active 
peptide. Figure 3A shows a single chain molecule and may also represent 
the DNA construct for the molecule. Figure 3B shows a dimer in which the 
linker-peptide portion is present on only one chain of the dimer. Figure 3C 

1 0 shows a dimer having the peptide portion on both chains. The dimer of 

Figure 3C will form spontaneously in certain host cells upon expression of 
a DNA construct encoding the single chain shown in Figure 3A. In other 
host cells, the cells could be placed in conditions favoring formation of 
dimers or the dimers can be formed in vitro . 

1 5 Figure 4 shows exemplary nucleic acid and amino acid sequences 

(SEQ ID NOS: 1 and 2, respectively) of human IgGl Fc that may be used in 
this invention. 

Figure 5 shows a synthetic scheme for the preparation of PEGylated 

peptide 19 (SEQ ID NO: 3). 
2 0 Figure 6 shows a synthetic scheme for the preparation of PEGylated 

peptide 20 (SEQ ID NO: 4). 

Figure 7 shows the nucleotide and amino acid sequences (SEQ ID 

NOS: 5 and 6, respectively) of the molecule identified as "Fc-TMP" in 

Example 2 hereinafter. 
2 5 Figure 8 shows the nucleotide and amino acid sequences (SEQ. ID. 

NOS: 7 and 8, respectively) of the molecule identified as "Fc-TMP-TMP" in 

Example 2 hereinafter. - - . 
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Figure 9 shows the nucleotide and amino acid sequences (SEQ. ID. 
NOS: 9 and 10, respectively) of the molecule identified as "TMP-TMP-Fc" 
in Example 2 hereinafter. 

Figure 10 shows the nucleotide and amino acid sequences (SEQ. ID. 
5 NOS: 11 and 12, respectively) of the molecule identified as "TMP-Fc" in 
Example 2 hereinafter. 

Figure 11 shows the number of platelets generated in vivo in 
normal female BDF1 mice treated with one 100 ng/kg bolus injection of 
various compounds, with the terms defined as follows. 
0 PEG-MGDF: 20 kD average molecular weight PEG attached by 

reductive amination to the N-terminal amino group of amino 
acids 1-163 of native human TPO, which is expressed in E. coli 
(so that it is not glycosylated); 
TMP: the TPO-mimetic peptide having the amino acid sequence 
L 5 IEGPTLRQWLAARA (SEQ ID NO: 13); 

TMP-TMP: the TPO-mimetic peptide having the amino acid 
sequence IEGPTLRQWLAARA-GGGGGGGG- 
IEGPTLRQWLAARA (SEQ ID NO: 14); 
PEG-TMP-TMP: the peptide of SEQ ID NO: 14, wherein the PEG 
2 o group is a 5 kD average molecular weight PEG attached as 

shown in Figure 6; 
Fc-TMP-TMP: the compound of SEQ ID NO: 8 (Figure 8) dimerized 
with an identical second monomer (i.e., Cys residues 7 and 10 
are bound to the corresponding Cys residues in the second 
2 5 monomer to form a dimer, as shown in Figure 2); and 

TMP-TMP-Fc is the compound of SEQ ID NO: 10 (Figure 9) 

dimerized in the same way as TMP-TMP-Fc except that the Fc 
domain is attached at the C-terminal end rather than the N- 
terminal end of the TMP-TMP peptide. 
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Figure 12 shows the number of platelets generated in vivo in 
normal BDF1 mice treated with various compounds delivered via 
implanted osmotic pumps over a 7-day period. The compounds are as 
defined for Figure 7. 
5 Figure 13 shows the nucleotide and amino acid sequences (SEQ. ID. 

NOS: 15 and 16, respectively) of the molecule identified as "Fc-EMP" in 
Example 3 hereinafter. 

Figure 14 shows the nucleotide and amino acid sequences (SEQ ID 
NOS: 17 and 18, respectively) of the molecule identified as "EMP-Fc" in 
1 0 Example 3 hereinafter. 

Figure 15 shows the nucleotide and amino acid sequences (SEQ ID 
NOS:19 and 20, respectively) of the molecule identified as "EMP-EMP-Fc" 
in Example 3 hereinafter. 

Figure 16 shows the nucleotide and amino acid sequences (SEQ ID 
1 5 NOS: 21 and 22, respectively) of the molecule identified as "Fc-EMP-EMP" 
in Example 3 hereinafter. 

Figures 17A and 17B show the DNA sequence (SEQ ID NO: 23) 
inserted into pCFM1656 between the unique Aat n (position #4364 in 
pCFM1656) and Sac II (position #4585 in pCFM1656) restriction sites to 
2 0 form expression plasmid pAMG21 (ATCC accession no. 98113). 

Figure 18A shows the hemoglobin, red blood cells, and hematocrit 
generated in vivo in normal female BDF1 mice treated with one 100 ng/kg 
bolus injection of various compounds. Figure 18B shows the same results 
with mice treated with 100 Mg/kg per day delivered the same dose by 7- 
2 5 day micro-osmotic pump with the EMPs delivered at 100 jig/kg, rhEPO at 
30U/mouse. (In both experiments, neutrophils, lymphocytes, and platelets 
were unaffected.) In these figures, the terms are defined as follows. 

Fc-EMP: the compound of SEQ ID NO: 16 (Figure 13) dimerized 

with an identical second monomer (i.e., Cys residues 7 and 10 are 
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bound to the corresponding Cys residues in the second monomer to 
form a dimer, as shown in Figure 2); 

EMP-Fc: the compound of SEQ ID NO: 18 (Figure 14) dimerized in 
the same way as Fc-EMP except that the Fc domain is attached at 
5 the C-terminal end rather than the N-terminal end of the EMP 

peptide. 

"EMP-EMP-Fc" refers to a tandem repeat of the same peptide (SEQ 
ID NO: 20) attached to the same Fc domain by the carboxyl 
terminus of the peptides. "Fc-EMP-EMP" refers to the same tandem 

1 o repeat of the peptide but with the same Fc domain attached at the 

amino terminus of the tandem repeat. All molecules are expressed 
in E. coli and so are not glycosylated. 

Figures 19A and 19B show the nucleotide and amino acid sequences 
(SEQ ID NOS: 1055 and 1056) of the Fc-TNF-a inhibitor fusion molecule 
1 5 described in Example 4 hereinafter. 

Figures 20A and 20B show the nucleotide and amino acid sequences 
(SEQ ID NOS: 1057 and 1058) of the TNF-a inhibitor-Fc fusion molecule 
described in Example 4 hereinafter. 

Figures 21A and 21B show the nucleotide and amino acid sequences 

2 0 (SEQ ID NOS: 1059 and 1060) of the Fc-IL-1 antagonist fusion molecule 

described in Example 5 hereinafter. 

Figures 22A and 22B show the nucleotide and amino acid sequences 
(SEQ ID NOS: 1061 and 1062) of the IL-1 antagonist-Fc fusion molecule 
described in Example 5 hereinafter. 
2 5 Figures 23 A, 23B, and 23C show the nucleotide and amino acid 

sequences (SEQ ID NOS: 1063 and 1064) of the Fc-VEGF antagonist fusion 
molecule described in Example 6 hereinafter. 

lb 
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Figures 24A and 24B show the nucleotide and amino acid sequences 
(SEQ ID NOS: 1065 and 1066) of the VEGF antagonist-Fc fusion molecule 
described in Example 6 hereinafter. 

Figures 25 A and 25B show the nucleotide and amino acid sequences 
5 (SEQ ID NOS: 1067 and 1068) of the Fc-MMP inhibitor fusion molecule 
described in Example 7 hereinafter. 

Figures 26A and 26B show the nucleotide and amino acid sequences 
(SEQ ID NOS: 1069 and 1070) of the MMP inhibitor-Fc fusion molecule 
described in Example 7 hereinafter. 
1 0 Detailed Description of the Invention 

Definition of Terms 

The terms used throughout this specification are defined as follows, 
unless otherwise limited in specific instances. 

The term "comprising" means that a compound may include 

1 5 additional amino acids on either or both of the N- or C- termini of the 
given sequence. Of course, these additional amino acids should not 
significantly interfere with the activity of the compound. 

The term "vehicle" refers to a molecule that prevents degradation 
and /or increases half-life, reduces toxicity, reduces immunogenicity, or 

2 0 increases biological activity of a therapeutic protein. Exemplary vehicles 
include an Fc domain (which is preferred) as well as a linear polymer (e.g., 
polyethylene glycol (PEG), polylysine, dextran, etc.); a branched-chain 
polymer (see, for example, U.S. Patent No. 4,289,872 to Denkenwalter et 
al., issued September 15, 1981; 5,229,490 to Tarn, issued July 20, 1993; WO 

2 5 93/21259 by Frechet et al ., published 28 October 1993); a lipid; a 

cholesterol group (such as a steroid); a carbohydrate or oligosaccharide; or 
any natural or synthetic protein, polypeptide or peptide that binds to a 
salvage receptor. Vehicles are further described hereinafter. 
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The term "native Fc" refers to molecule or sequence comprising the 
sequence of a non-antigen-binding fragment resulting from digestion of 
whole antibody, whether in monomelic or multimeric form. The original 
immunoglobulin source of the native Fc is preferably of human origin and 
5 may be any of the immunoglobulins, although IgGl and IgG2 are 

preferred. Native Fc's are made up of monomelic polypeptides that may 
be linked into dimeric or multimeric forms by covalent (i.e., disulfide 
bonds) and non-covalent association. The number of intermolecular 
disulfide bonds between monomelic subunits of native Fc molecules 

1 0 ranges from 1 to 4 depending on class (e.g., IgG, IgA, IgE) or subclass (e.g., 
IgGl, IgG2, IgG3, IgAl, IgGA2). One example of a native Fc is a disulfide- 
bonded dimer resulting from papain digestion of an IgG (see Ellison et al . 
(1982), Nucleic Acids Res . 10: 4071-9). The term "native Fc" as used herein 
is generic to the monomelic, dimeric, and multimeric forms. 

1 5 The term "Fc variant" refers to a molecule or sequence that is 

modified from a native Fc but still comprises a binding site for the salvage 
receptor, FcRn. International applications WO 97/34631 (published 25 
September 1997) and WO 96/32478 describe exemplary Fc variants, as 
well as interaction with the salvage receptor, and are hereby incorporated 

20 by reference. Thus, the term "Fc variant" comprises a molecule or 

sequence that is humanized from a non-human native Fc. Furthermore, a 
native Fc comprises sites that may be removed because they provide 
structural features or biological activity that are not required for the fusion 
molecules of the present invention. Thus, the term "Fc variant" comprises 

25 a molecule or sequence that lacks one or more native Fc sites or residues 
that affect or are involved in (1) disulfide bond formation, (2) 
incompatibility with a selected host cell (3) N-terminal heterogeneity upon 
expression in a selected host cell, (4) glycosylation, (5) interaction with 
complement, (6) binding to an Fc receptor other than a salvage receptor, or 
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(7) antibody-dependent cellular cytotoxicity (ADCC). Fc variants are 
described in further detail hereinafter. 

The term "Fc domain' 7 encompasses native Fc and Fc variant 
molecules and sequences as defined above. As with Fc variants and native 
5 Fc's, the term "Fc domain" includes molecules in monomelic or 

multimeric form, whether digested from whole antibody or produced by 
other means. 

The term "multimer" as applied to Fc domains or molecules 
comprising Fc domains refers to molecules having two or more 

1 0 polypeptide chains associated covalently, noncovalently, or by both 
covalent and non-covalent interactions. IgG molecules typically form 
dimers; IgM, pentamers; IgD, dimers; and IgA, monomers, dimers, 
trimers, or tetramers. Multimers may be formed by exploiting the 
sequence and resulting activity of the native Ig source of the Fc or by 

1 5 derivatizing (as defined below) such a native Fc. 

The term "dimer" as applied to Fc domains or molecules 
comprising Fc domains refers to molecules having two polypeptide chains 
associated covalently or non-covalently. Thus, exemplary dimers within 
the scope of this invention are as shown in Figure 2. 

2 0 The terms "derivatizing" and "derivative" or "derivatized" 

comprise processes and resulting compounds respectively in which (1) the 
compound has a cyclic portion; for example, cross-linking between 
cysteinyl residues within the compound; (2) the compound is cross-linked 
or has a cross-linking site; for example, the compound has a cysteinyl 

2 5 residue and thus forms cross-linked dimers in culture or in vivo; (3) one or 
more peptidyl linkage is replaced by a non-peptidyl linkage; (4) the N- 
terminus is replaced by -NRR\ NRC(0)R\ -NRC(0)OR-, -NRSCO)^ 1 , - 
NHC(0)NHR, a succinimide group, or substituted or unsubstituted 
benzyloxycarbonyl-NH-, wherein R and R 1 and the ring substituents are 
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as defined hereinafter; (5) the C-terminus is replaced by -C(0)R 2 or -NR*R 4 
wherein R 2 , R 3 and R 4 are as defined hereinafter; and (6) compounds in 
which individual amino acid moieties are modified through treatment 
with agents capable of reacting with selected side chains or terminal 
5 residues. Derivatives are further described hereinafter. 

The term "peptide" refers to molecules of 2 to 40 amino acids, with 
molecules of 3 to 20 amino acids preferred and those of 6 to 15 amino acids 
most preferred. Exemplary peptides may be randomly generated by any 
of the methods cited above, carried in a peptide library (e.g., a phage 

1 0 display library), or derived by digestion of proteins. 

The term "randomized" as used to refer to peptide sequences refers 
to fully random sequences (e.g., selected by phage display methods) and 
sequences in which one or more residues of a naturally occurring molecule 
is replaced by an amino acid residue not appearing in that position in the 

1 5 naturally occurring molecule. Exemplary methods for identifying peptide 
sequences include phage display, E. coli display, ribosome display, RNA- 
peptide screening, chemical screening, and the like. 

The term "pharmacologically active" means that a substance so 
described is determined to have activity that affects a medical parameter 

2 0 (e.g., blood pressure, blood cell count, cholesterol level) or disease state 
(e.g., cancer, autoimmune disorders). Thus, pharmacologically active 
peptides comprise agonistic or mimetic and antagonistic peptides as 
defined below. 

The terms "-mimetic peptide" and "-agonist peptide" refer to a 
2 5 peptide having biological activity comparable to a protein (e.g., EPO, TPO, 
G-CSF) that interacts with a protein of interest. These terms further 
include peptides that indirectly mimic the activity of a protein of interest, 
such as by potentiating the effects of the natural ligand of the protein of 
interest; see, for example, the G-CSF-mimetic peptides listed in Tables 2 
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and 7. Thus, the term "EPO-mimetic peptide'' comprises any peptides that 
can be identified or derived as described in Wrighton et aL (1996), Science 
273 : 458-63, Naranda etaL (1999), Proc. Natl. Acad. Sci. USA 96: 7569-74, 
or any other reference in Table 2 identified as having EPO-mimetic subject 
5 matter. Those of ordinary skill in the art appreciate that each of these 
references enables one to select different peptides than actually disclosed 
therein by following the disclosed procedures with different peptide 
libraries. 

The term "TPO-mimetic peptide" comprises peptides that can be 

1 0 identified or derived as described in Cwirla etal. (1997), Science 276: 1696- 
9 , U.S. Pat. Nos. 5,869,451 and 5,932,946 and any other reference in Table 2 
identifed as having TPO-mimetic subject matter, as well as the U.S. patent 
application, "Thrombopoietic Compounds/' filed on even date herewith 
and hereby incorporated by reference. Those of ordinary skill in the art 

1 5 appreciate that each of these references enables one to select different 
peptides than actually disclosed therein by following the disclosed 
procedures with different peptide libraries. 

The term "G-CSF-mimetic peptide" comprises any peptides that 
can be identified or described in Paukovits etal. (1984), Hoppe-Sevlers Z. 

2 0 Physiol. Chem . 365: 303-11 or any of the references in Table 2 identified as 
having G-CSF-mimetic subject matter. Those of ordinary skill in the art 
appreciate that each of these references enables one to select different 
peptides than actually disclosed therein by following the disclosed 
procedures with different peptide libraries. 

2 5 The term "CTLA4-mimetic peptide" comprises any peptides that 

can be identified or derived as described in Fukumoto etal. (1998), Nature 
Biotech . 16: 267-70. Those of ordinary skill in the art appreeiate-that each of 
these references enables one to select different peptides than actually 
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disclosed therein by following the disclosed procedures with different 
peptide libraries. 

The term "-antagonist peptide" or "inhibitor peptide" refers to a 
peptide that blocks or in some way interferes with the biological activity of 
5 the associated protein of interest, or has biological activity comparable to a 
known antagonist or inhibitor of the associated protein of interest Thus, 
the term "TNF-antagonist peptide" comprises peptides that can be 
identified or derived as described in Takasaki etal. (1997), Nature Biotech , 
15: 1266-70 or any of the references in Table 2 identified as having TNF- 
1 0 antagonistic subject matter. Those of ordinary skill in the art appreciate 
that each of these references enables one to select different peptides than 
actually disclosed therein by following the disclosed procedures with 
different peptide libraries. 

The terms "IL-1 antagonist" and "IL-lra-mimetic peptide" 
1 5 comprises peptides that inhibit or down-regulate activation of the IL-1 
receptor by IL-1. IL-1 receptor activation results from formation of a 
complex among IL-1, IL-1 receptor, and IL-1 receptor accessory protein. 
IL-1 antagonist or IL-lra-mimetic peptides bind to IL-1, IL-1 receptor, or 
IL-1 receptor accessory protein and obstruct complex formation among 
2 0 any two or three components of the complex. Exemplary IL-1 antagonist 
or IL-lra-mimetic peptides can be identified or derived as described in 
U.S. Pat. Nos. 5,608,035, 5,786331, 5,880,096, or any of the references in 
Table 2 identified as having IL-lra-mimetic or IL-1 antagonistic subject 
matter. Those of ordinary skill in the art appreciate that each of these 
2 5 references enables one to select different peptides than actually disclosed 
therein by following the disclosed procedures with different peptide 

libraries. ~ 

The term "VEGF-antagonist peptide" comprises peptides that can 
be identified or derived as described in Fairbrother (1998), Biochem. 37: 
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17754-64, and in any of the references in Table 2 identified as having 
VEGF-antagonistic subject matter. Those of ordinary skill in the art 
appreciate that each of these references enables one to select different 
peptides than actually disclosed therein by following the disclosed 
5 procedures with different peptide libraries. 

The term "MMP inhibitor peptide" comprises peptides that can be 
identified or derived as described in Koivunen (1999), Nature Biotech. 17: 
768-74 and in any of the references in Table 2 identified as having MMP 
inhibitory subject matter. Those of ordinary skill in the art appreciate that 

1 0 each of these references enables one to select different peptides than 
actually disclosed therein by following the disclosed procedures with 
different peptide libraries. 

Additionally, physiologically acceptable salts of the compounds of 
this invention are also encompassed herein. By "physiologically 

15 acceptable salts" is meant any salts that are known or later discovered to 
be pharmaceutically acceptable. Some specific examples are: acetate; 
trifluoroacetate; hydrohalides, such as hydrochloride and hydrobromide; 
sulfate; citrate; tartrate; glycolate; and oxalate. 
Structure of compounds 

2 0 In General . In the compositions of matter prepared in accordance 

with this invention, the peptide may be attached to the vehicle through the 
peptide's N-terminus or C-terminus. Thus, the vehicle-peptide molecules 
of this invention may be described by the following formula I: 
I 

2 5 (XVF 1 -(X 2 ) b 
wherein: 

F' is a vehicle (preferably an Fc domain); 

X' and X' are each independently selected from -(L l ) c -P\ -(L') c -P'- 
(L\ -P 2 , -a , ) t -P , -(LVP 2 -<LVP 3 , -(LVP'-aVP^L 3 ), -P'-ttA-P 4 
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P\ P 2 , P 3 , and P 4 are each independently sequences of 
pharmacologically active peptides; 

L\ V, L 3 , and L 4 are each independently linkers; and 
a, b, c, d, e, and f are each independently 0 or 1, provided that at 
5 least one of a and b is 1. 

Thus, compound I comprises preferred compounds of the formulae 

n 

X 1 -F 

and multimers thereof wherein F 1 is an Fc domain and is attached at the C- 
1 0 terminus of X'; 

m 

F-X 2 

and multimers thereof wherein F 1 is an Fc domain and is attached at the N- 
terminus of X 1 ; 
15 rv 

and multimers thereof wherein F 1 is an Fc domain and is attached at the N- 

terminus of -(V)-P'; and 

V 

20 FHLX-P'-dX-P 2 

and multimers thereof wherein F 1 is an Fc domain and is attached at the N- 
terminus of -L'-P'-lZ-P 2 . 

Peptides . Any number of peptides may be used in conjunction with 
the present invention. Of particular interest are peptides that mimic the 

2 5 activity of EPO, TPO, growth hormone, G-CSF, GM-CSF, IL-lra, leptin, 
CTLA4, TRAIL, TGF-a, and TGF-p. Peptide antagonists are also of 
interest, particularly those antagonistic to the activity of TNF, leptin, any 
of the interleukins (IL-1, 2, 3, ...), and proteins involved in complement 
activation (e.g., C3b). Targeting peptides are also of interest, including 

>H 
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tumor-homing peptides, membrane-transporting peptides, and the like. 
All of these classes of peptides may be discovered by methods described in 
the references cited in this specification and other references. 

Phage display, in particular, is useful in generating peptides for use 
5 in the present invention. It has been stated that affinity selection from 
libraries of random peptides can be used to identify peptide ligands for 
any site of any gene product. Dedman et al . (1993), I. Biol. Chem . 268: 
23025-30. Phage display is particularly well suited for identifying peptides 
that bind to such proteins of interest as cell surface receptors or any 

1 0 proteins having linear epitopes. Wilson et al. (1998), Can. T. Microbiol. 44: 
313-29; Kay et al . (1998), Drug Disc. Today 3: 370-8. Such proteins are 
extensively reviewed in Herz et al . (1997), T. Receptor & Signal 
Transduction Res . 17(5): 671-776, which is hereby incorporated by 
reference. Such proteins of interest are preferred for use in this invention. 

15 A particularly preferred group of peptides are those that bind to 

cytokine receptors. Cytokines have recently been classified according to 
their receptor code. See Inglot (1997), Archivum Immunologiae et 
Therapiae Experimen talis 45: 353-7, which is hereby incorporated by 
reference. Among these receptors, most preferred are the CKRs (family I in 

2 0 Table 3). The receptor classification appears in Table 3. 
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Table 3 — Cytokine Receptors Classified by Receptor Code 



Cytokines (ligands) 


I Receptor Type 


m ii 1~ M 11- - 

family subfamily 


family suofamiiy 


1. Hematopoietic 1. IL-2, IL-4 f IL-7, 
cytokines IL-9, IL-13. IL- 
1 5 

2. IL-3. IL-5, GM- 
pec 

3. IL-6JL-11JL- 
12, LIF, OSM t 
CNTF, leptin 
(UB) 

4. G-CSF. EPO, 
TPO, PRL, GH 

5. IL-17, HVS-IL- 
17 


I. Cytokine R 1 . shared yCr 
(CKR) 

2. shared GP 140 
prt 

3. 3.shared RP 
130 

4. "single chain" R 

5. other R° 


II. IL-10 ligands IL-10, BCRF-1, 
HSV-IL-10 


II. IL-10 R 


III. Interferons 1. IFN-al f a2, a4, 

m. t. IFN-p* 
2. IFN-y 


III. Interferon R 1. IFNAR 
2. IFNGR 


IV. IL-1 ligands 1. IL-1a, IL-1p, IL- 

1Ra 


IV. IL-1R 


V. TNF ligands 1. TNF-a.TNF-p 
(LT).FAS1. 
CD40 L, 
CD30L, CD27 L 


V. NGF/TNF R* 


VI. Chemokines 1. achemokines: 

IL-8, GRO 
c^p.y, IF-10, 
PF-4, SDF-1 

2. p chemokines: 
MIP1a, MIPip, 
MCP-1 ,2.3,4, 
RANTES, 
eotaxin 

3. y chemokines: 
lymphotactin 


VI. ChemokineR 1. CXCR 

2. CCR 

3. CR 

4. DARC' 



c 1L-17R belongs to the CKR family but is not assigned to any of the 4 indicated subjamilies. 
d Other IFN type I subtypes remain unassigned. Hematopoietic cytokines, IL-10 ligands and 
interferons do not possess functional intrinsic protein kinases. The signaling molecules for the 
cytokines are JAK's, STATs and related non-receptor molecules. IL-14, IL-16 and IL-18 have been 
cloned but according to the receptor code they remain unassigned. 

4 TNF receptors use multiple, distinct intracellular molecules for signal transduction including 
"death domain" of FAS R and 55 kDa TNF-aR that participates in their cytotoxic effects. NGF/TNF 
R can bind both NGF and related factors as well as TNF ligands. Chemokine receptors are G 
protein-coupled, seven transmembrane (7TM, serpentine) domain receptors. 

The Duffy blood group antigen (OARC) is an erythrocyte receptor that can bind several different 
chemokines. It belongs to the immunoglobulin superfamily but characteristics of its signal 
transduction events remain undear. 
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VII. Growth factors 



1.1 SCF, M-CSF, 



VII. RKF 



1. 
1.1 



TK sub-family 
IgTK III R 



PDGF-AA, AB ( 
BB, FLT-3L, 
VEGF, SSV- 
PDGF 



1.2 FGFa, FGF0 

1.3 EGF, TGF-a, 



W-F19(EGF- 
like) 



1.2 
1.3 



IgTK IV R 

Cysteine-rich 

TK-I 



1.4 IQF-I, IGF-II. 
Insulin 

1.5 NGF, BDNF, 
IMT-3, NT-4° 

2. TGF-p1,p2,33 



1.4 



Cysteine rich 
TK-II 



1.5 



Cysteine knot 
TK V 



2. 



STK subfamily" 



Exemplary peptides for this invention appear in Tables 4 through 
20 below. These peptides may be prepared by methods disclosed in the 
5 art. Single letter amino acid abbreviations are used. The X in these 

sequences (and throughout this specification, unless specified otherwise in 
a particular instance) means that any of the 20 naturally occurring amino 
acid residues may be present. Any of these peptides may be linked in 
tandem (i.e., sequentially), with or without linkers, and a few tandem- 

1 0 linked examples are provided in the table. Linkers are listed as "A" and 
may be any of the linkers described herein. Tandem repeats and linkers 
are shown separated by dashes for clarity. Any peptide containing a 
cysteinyl residue may be cross-linked with another Cys-containing 
peptide, either or both of which may be linked to a vehicle. A few cross- 

15 linked examples are provided in the table. Any peptide having more than 
one Cys residue may form an intrapeptide disulfide bond, as well; see, for 
example, EPO-mimetic peptides in Table 5. A few examples of 
intrapeptide disulfide-bonded peptides are specified in the table. Any of 
these peptides may be derivatized as described herein, and a few 

2 0 derivatized examples are provided in the table. Derivatizea peptides in 

Q The neurotrophic cytokines can associate with NGF/TNF receptors also. 
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the tables are exemplary rather than limiting, as the associated 
underivatized peptides may be employed in this invention, as well. For 
derivatives in which the carboxyl terminus may be capped with an amino 
group, the capping amino group is shown as -NHj. For derivatives in 
5 which amino acid residues are substituted by moieties other than amino 
acid residues, the substitutions are denoted by a, which signifies any of 
the moieties described in Bhatnagar et al . (1996), T. Med. Chem . 39: 3814-9 
and Cuthbertson et al . (1997), T. Med. Chem . 40: 2876-82, which are 
incorporated by reference. The J substituent and the Z substituents (Z^ TL^ 

10 . . .ZJ are as defined in U.S. Pat. Nos. 5,608,035 ,5,786331, and 5,880,096, 
which are incorporated by reference. For the EPO-mimetic sequences 
(Table 5), the substituents X^ through X„ and the integer "n" are as defined 
in WO 96/40772, which is incorporated by reference. The substituents 
"0," and "+" are as defined in Sparks et al . (1996), Proc. Natl. Acad. Sci . 93: 

1 5 1540-4, which is hereby incorporated by reference. X v X„ and X 7 are as 
defined in U.S. Pat. No. 5,773,569, which is hereby incorporated by 
reference, except that: for integrin-binding peptides, X„ X 2 , X y X«, X^ X^ X 7 , 
and X 8 are as defined in International applications WO 95/14714, 
published June 1, 1995 and WO 97/08203, published March 6, 1997, which 

2 0 are also incorporated by reference; and for VIP-mimetic peptides, X,, X/, 
X/', X 2 , X y X v Xj, X 6 and Z and the integers m and n are as defined in WO 
97/40070, published October 30, 1997, which is also incorporated by 
reference. Xaa and Yaa below are as defined in WO 98/09985, published 
March 12, 1998, which is incorporated by reference. AA„ AA^ AB„ AB 2 , 

2 5 and AC are as defined in International application WO 98/53842, 

published December 3, 1998, which is incorporated by reference. X 1 , X 2 , X 3 , 
and X 4 in Table 17 only are as defined in European application EP 0 911 



h STKS may encompass many other TGF-p-related factors that remain unassigned. The protein 
kinases are intrinsic part of the intracellular domain of receptor kinase family (RKF). The enzymes 
participate in the signals transmission via the receptors. 
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393, published April 28, 1999. Residues appearing in boldface are D- 
amino acids. All peptides are linked through peptide bonds unless 
otherwise noted. Abbreviations are listed at the end of this specification. In 
the "SEQ ID NO." column, "NR" means that no sequence listing is required 
5 for the given sequence. 



Table 4 — IL-1 antagonist peptide sequences 



Sequence/structure 


SEQ 
ID NO: 




212 


XXQZ,YZ«XX 


907 


Z^QZ^YZ^XX 


908 




909 


Z^QZ.YZ^ 


910 




917 


Z^NZ^Z^Z^Z^Z^Z^ 


979 


TANVSSFEWTPYYWQPYALPL 


213 


SWTDYGYWQPYALPISGL 


214 


ETPFTWEESNAYYWQPYALPL 


215 


ENTYSPNWADSMYWQPYALPL 


216 


SVGEDHNFWTSEYWQPYALPL 


217 


DGYDRWRQSGERYWQPYALPL 


218 


FEWTPGYWQPY 


219 


FEWTPGYWQHY 


220 


FEWTPGWYQJY 


221 


AcFEWTPGWYQJY 


222 


FEWTPGWpYQJY 


223 


FAWTPGYWQJY 


224 


FEWAPGYWQJY 


225 


FEWVPGYWQJY 


226 


FEWTPGYWQJY 


227 


AcFEWTPGYWQJY 


228 


FEWTPaWYQJY 


229 


FEWTPSarWYQJY 


230 


FEWTPGYYQPY 


231 


FEWTPGWWQPY 


232 | 


FEWTPNYWQPY 


233 


FEWTPvYWQJY 


234 


FEWTPecGYWQJY 


235 


FEWTPAibYWQJY 


236 


FEWTSarGYWQJY 


237 


FEWTPGYWQPY 


238 


FEWTPGYWQHY 


239 


FEWTPGWYQJY 


240 
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Merc VV 1 roWYvjJY 


241 


rCW 1 r O VV *p T -\J0 T 


OAO 

242 


FA\A/TP/^V\A/0_ IV 
rMW 1 rVaYWvjJY 


OA1 

243 


rpu/A DriVWA IV 
rtWMru Y WvJJ T 


1AA 

244 


rCWVruYWvJJY 


245 


rtW i r(jY WvJJY 


246 


A<»CC\A/TD/^\/lill^ IV/ 

ACrcWTPGYWQJY 


247 


r t WTP A VVYQJ Y 


248 


FEVVTPSarWYQJY 


249 


FEWTPGYYQPY 


250 


FEWTPGVVWQPY 


251 


FEVVTPNYWQPY 


252 


FEWTPVYWQJY 


253 


FEWTPecGYWOJY 


254 


FEVVTPAibYWQJY 


255 


FEWTSarGYWQJY 


256 


FEWTPGYWQPYALPL 


257 


1 NapEWTPGYYQJY 


258 


YEWTPGYYQJY 


259 


FEWVPGYYQJY 


260 


FEVVTPSYYQJY 


261 


FEVVTPNYYQJY 


262 


TKPR 


263 


RKSSK 


264 


RKQDK 


265 


NRKQDK 


266 


RKQDKR 


267 | 


ENRKQDKRF 


268 


VTKFYF 


269 


VTKFY 


270 


VTDFY 


271 


SHLYWQPYSVQ 


671 


TLVYWQPYSLQT 


672 


RGDYWQPYSVQS 


673 


VHVYWQPYSVQT 


674 


RLVYWQPYSVQT 


675 


SRWVFQPYSLQS 


676 


IMMV T WUr YolvJ 1 


o77 


SWFWQPYSVQT 


678 


TFVYWQPYALPL 


679 


TLVYWQPYSIQR 


680 


RLVYWQPYSVQR 


681 


SPVFWQPYSIQI 


682 


WIEWWQPYSVGS 


683 


SLIYWQPYSLQM 


684 


TRLYWQPYSVQR ~^ 


- 685 


RCDYWQPYSVQT 


686 


MRVFWQPYSVQN 


687 


KIVYWQPYSVQT 


688 


RHLYWQPYSVQR 


689 
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ALVWWQPYSEQ! 


690 


SRVWFQPYSLQS 


691 


WEQPYALPLE 


692 


QLVWWQPYSVQR 


693 


DLRYWQPYSVQV 


694 


ELVWWQPYSLQL 


695 


DLVWWQPYSVQW 


696 


NGNYWQPYSFQV 


697 


ELVYWQPYSIQR 


698 


ELMYWQPYSVQE 


699 


NLLYWQPYSMQD 


700 


GYEWYQPYSVQR 


701 ! 


SRVWYQPYSVQR 


702 


LSEQYQPYSVQR 


703 


GGGWWQPYSVQR 


1 704 


VGRWYQPYSVQR 


705 


VHVYWQPYSVQR 


706 


QARWYQPYSVQR 


707 


VHVYWQPYSVQT 


708 


RSVYWQPYSVQR 


709 


TRVWFQPYSVQR 


710 


GRIWFQPYSVQR 


711 


GRVWFQPYSVQR 


712 


ARTWYQPYSVQR 


713 


ARVWWQPYSVQM 


714 


RLMFYQPYSVQR 


715 


ESMWYQPYSVQR 


716 


HFGWWQPYSVHM 


717 


ARFWWQPYSVQR 


718 


RLVYWQ PYAPIY 


719 


RLVYWQ PYSYQT 


720 


RLVYWQ PYSLPI 


721 


RLVYWQ PYSVQA 


722 


SRVWYQ PYAKGL 


723 


SRVWYQ PYAQGL 


724 


SRVWYQ PYAMPL 


725 


SRVWYQ PYSVQA 


726 


SRVWYQ PYSLGL 


727 


SRVWYQ PYAREL 


728 


SRVWYQ PYSRQP 


729 


SRVWYQ PYFVQP 


730 


EYEWYQ PYALPL 


731 j 


IPEYWQ PYALPL 


732 


SRIWWQ PYALPL 


733 


DPLFWQ PYALPL 


734 


SRQWVQ PYALPL j 


735 


IRSWWQ PYALPL 


736 


RGYWQ PYALPL 


737 


RLLWVQ PYALPL 


738 


EYRWFQ PYALPL 


739 j 



31 



WO 00/24782 PCT/US99/25044 



HAV\AA/n OVA 1 Dl 


740 


\A/QHVPn PVAI PI 
WOO T rU r TMLrL 


741 


KJIPC\A/0 PVAI PI 
iNltrWU rTMLrL 




TDHVAA/rt DVAI Dl 

1 HUWVU rYALrL 




neeuuvn dvai di 
UooWYU rYALrL 


7AA 


ICaNWYU rYALrL 




All SUinA nv/A| Q| 

NLRWDQ PYALPL 


/4o 


LPEFWQ PYALPL 


TAT 


DSYWWQ PYALPL 


748 


RSQYYQ PYALPL 


749 


ARFWLQ PYALPL 


750 ! 


NSYFWQ PYALPL 


751 


RFMYWQPYSVQR 


752 


AHLFWQPYSVQR 


753 


WWQPYALPL 


754 


YYQPYALPL 


755 


YFQPYALGL 


756 


YWYQPYALPL 


757 j 


RWWQPYATPL 


758 


GWYQPYALGF 


759 ! 


YWYQPYALGL 


760 


IWYQPYAMPL 


761 


SNMQPYQRLS 


762 


TFVYWQPY AVGLPAAETACN 


763 


TFVYWQPY SVQMTITGKVTM 


764 


TFVYWQPY SSHXXVPXGFPL 


765 


TFVYWQPY YGNPQWAIHVRH 


766 


TFVYWQPY VLLELPEGAVRA 


767 


TFVYWQPY VDYVWPIPIAQV 


768 


GWYQPYVDGWR 


769 


RWEQPYVKDGWS 


770 


EWYQPYALGWAR 


771 


GWWQPYARGL 


772 


LFEQPYAKALGL 


773 


GWEQPYARGLAG 


774 


AWVQPYATPLDE 


775 


M WYQ P YSSQP AE 


776 


fiwTOPY^nnfiPv 

OW 1 Ur TOUUUCV 


777 


DWFQPYSIQSDE 


778 


PWIQPYARGFG 


779 


RPLYWQPYSVQV 


780 


TLIYWQPYSVQI 


781 


RFDYWQPYSDQT 


782 


WHQFVQPYALPL 


783 


EWDS VYWQPYSVQ TLLR 


784 


WEQN VYWQPYSVQ SFAD 


~ 785 


SDV VYWQPYSVQ SLEM 


786 | 


YYDG VYWQPYSVQ VMPA 


787 


SDIWYQ PYALPL 


788 


QRIWWQ PYALPL 


789 
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SRIWWO PYALPL 


790 


R^l YWO PYAI PI 


791 


TllWPn PYAI PL 


792 


WPTWYO PYAI PL 


793 


QVnWFO PYAI PI 


794 


QRIWPH PYAI Pt 


795 


FIMPWn PYAI PI 
CliVirWW t TnLrL 


796 


RYV/WOO PYAI PI 
UT V WWW « T MLrL 


797 


Mm 1 V/O WYOPYAI PI 


\ 798 


f^QkVII WVOPYAI PI 
V30IWIL WTUr T nLrL 


799 


POHAMI WYOPYAI PI 




PPr^nCD WVOPVAI PI 


801 


OAi CDT IA/VODVAI Dl 


OUZ 


CTtAA/DC lA/VHDVA 1 Dl 

t IWVnt WYUrYMLrL 




KISuo 1 w WYUrYALrL 


! QTM 


LQAnMIM WYQrYALrL 


one 
QUO 


EPRSQK WYQrYALrL 


ana 


VKQKWR WYQrYALrL 


Q/V7 

8U7 


LRRHDV WYQPYALPL 


O AO 

808 


RSTAS1 WYQPYALPL 




ESKEDQ WYQPYALPL 


810 


EGLTMK WYQPYALPL 


oil 


EGSREG WYQPYALPL 


812 


VIEWWQ PYALPL 


813 


VWYWEQ PYALPL 


814 


* nrtiAi/A nv/Al nr 

ASEWWQ PYALPL 


815 


FYEWWQ PYALPL 


OlD 


EGWWVQ PYALPL 


81/ 


I A//"* CI A/I /*> DX/AI Dl 

WGfcWLQ rYALrL 


fil ft 


nv\ni/co dvai Dl 
DYVWcU rYALrL 


RIO 


a LJT"VAAA//"*S DV/AI Dl 

AHTWWQ rYALrL 


R2ft \ 


CICVA/CO DVAI Dl 

rltWrU rYALrL 


821 i 


VAII A lA/po PYAI PI 
WLMWtU rTMLTL 




WMPXAAA/D PYAI PI 

VMtWWU rTMLrL _J 


823 


PDMWO PYAI PI 

tniviww rTMLrL 


824 


MYVWYY PYAI PI 


825 1 


WfiNWYO PYAI Pt 


826 j 


TLYWEQ PYALPL 


827 


VWRWEQ PYALPL 


828 


LLWTQ PYALPL 


829 


SRIWXX PYALPL 


830 


SDIWYQ PYALPL 


831 


WGYYXX PYALPL 


832 


TSGWYQ PYALPL 


833 


VHPYXX PYALPL 


834 


EHSYFQ PYALPL 


835 


XXIWYQ PYALPL 


836 


AQLHSQ PYALPL 


837 


WANWFQ PYALPL 


838 


SRLYSQ PYALPL 


839 
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GVTFSO PYALPL 


840 


9IVWSO PYALPL 


841 


^RDLVO PYALPL 


842 


HWfiH VYWOPY^VO DDLG 


843 


c\A/uc VYWOPY^VO <%\/PP 


844 


wnnQ wwnPYQvn pp^a 

WnUO V T VVVJr TOVU i tOW 


845 


TVA/nA wwnPY^vn kwi n 

1 VV L»M V T WVir T O VVW rVVVLU 


846 


tppw wwnPYQvn hp 

1 rrVf V T VVVJr T OVVJ OL4-/I 


847 


Y Woo VTWUrToVvJ OVPIO 


848 


VWV OPV Al f$l 


849 


VWV OPV Al PI 
YWT \Jr T MLrL 


850 


CWI OPV AT^I 


851 


KIVA/C ODV ALfDI 

INWfc VJrY AI\rL 


AS7 


ArY VJrY ALrL 


ft 1 ?** 
ooo 


CI V AQV Al Dl 

FLY UrY ALrL 


ASA 


VCK QrY LcWU 


ASS 

OOO 


CTDmA/CCCMAWUUODVAl PI 

bTrrTWbbbNAYYWUrYALrL 


ASA 


/"VMAII T\A/Or>C\/nkiiV\A/OPVAI PI 

LAjVVL 1 WUUoVUMTVVUrTALrL 


AS7 

OO/ 


CC C A ^* VTU/DC MTVWAD V A 1 PI 


ASA 


TCODm n\»/A 1/IVAA/rkDVAI PI 

1 fcorOU^UWAMYWLJr YMLrL 


ASQ 


nr»Vr\D\A/DnCr^CDV\A/nDVAI PI 


aaa 


TAM\/eCCC\A/TDr*\AA/ODVAI PI 

TAN VoorbW 1 ruYWUrTALrL 


AA1 


0\/OCnUMDA/TCC WAJODVAt PI 

oVotUniNlrVV 1 ob YWUrTMLrL 


ooz. 


iHMnnrccv/CTCD via/hdvai PI 
MNDQTSbVbl rr YWUrYAurL 


AA^ 


CVAfCCACCADDMI VU/HDVA 1 Dl 

bWbtArcUrnNL Y. WvJr YALrL 


AAA 


AVACDCAI RirMAir* V\A/HDV A 1 Dl 

UYAtroALNUVVo YWUrYALrL 


HAS 


kir»n\A/ATAn\A/OMV naa/hdvai pi 
NbUWAI AUWolNY YWvJrYALrL 


AAA 


TUnCUl WA/HDVAI Dl 

THDbHl YWUrYALrL 


RA7 
OO/ 


i ji ri/T\/T 1VA/TDO NAAiADVAl PI 

MLhK 1 Y 1 1 W 1 ro YWUrYALrL 


AAA 
ooo 


\AfCHDI TDRAHI NAA/APVA! PI 
WbUrL 1 HUAUL YWwrYALrL 


AAO 


CnACTTOncnAM VWODVAl pi 
oUAr 1 1 UUoUAIVI Y WUrYALrL 


A7fl 

O/xi 


pnnA A \ A/ DTP* CI T VU/OPVAI PI 
bUUAAWnlUoLI YWUrTMLrL 


A71 


AHROI VPWQFM VXA/OPYAI PI 


872 


CKITVCPKIlA/AnQM VVA/OPVAl PI 
tN 1 YorlMWAUoM TWUrTMLrL 




MMnOTQPV/QTPP YWOPYAI PI 
MINUUIOCVOlrr TWUrTMLrL 


874 


Q\/rjcnWNiPWTQP YWnPYAI PI 

OVUtUtllNr W 1 OC ifVUrTMLrL 


875 


nTDrrwppQMAY vwopyai pi 

KJ 1 rr I WCCoINM T TVVUr TMLrL 


A7A 


ENPFTWQESNAY YWQPYALPL 


877 


VTPFTWEDSNVF YWQPYALPL 


878 


QIPFTWEQSNAY YWQPYALPL 


879 


QAPLTWQESAAY YWQPYALPL 


880 


EPTFTWEESKAT YWQPYALPL 


881 


TTTLTWEESNAY YWQPYALPL 


882 


ESPLTWEESSAL YWQPYALPL 


883 


ETPLTWEESNAY YWQPYALPL 


884 


EATFTWAESNAY YWQPYALPL 


" 885 


EALFTWKESTAY YWQPYALPL 


886 


STP-TWEESNAY YWQPYALPL 


887 


ETPFTWEESNAY YWQPYALPL 


888 


KAPFTWEESQAY YWQPYALPL 


889 
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STSFTWEESNAY YWQPYALPL 


890 


DSTFTWEESNAY YWQPYALPL 


891 


YIPFTWEESNAY YWQPYALPL 


892 


QTAFTWEESNAY YWQPYALPL 


893 


CTI CTUUCCCU AT WA/ODVAI Dl 




VSSrTWccbNAY YWQPYALPL 


one 


QPYALPL 


Oft/ 


Py-1 -NapPYQJYALPL 


897 


TANVSSFEWTPG YWQPYALPL 


898 


FEWTPGYWQPYALPL 


899 


FEWTPGYWQJYALPL 


900 


FEWTPGYYQJYALPL 


901 


ETPFTWEESNAYYWQPYALPL 


902 


FTWEESNAYYWQJYALPL 


903 


ADVL YWQPYA PVTLWV 


! 904 


GDVAE YWQPYA LPLTSL 


905 


SWTDYG YWQPYA LPISGL 


906 


FEWTPGYWQPYALPL 


911 


FEWTPGYWQJYALPL 


[ 912 


FEWTPGWYQPYALPL 


913 


FEWTPGWYQJYALPL 


914 


FEWTPGYYQPYALPL 


915 


FEWTPGYYQJYALPL 


916 


TANVSSFEWTPGYWQPYALPL 


918 


S WTD YG YWQP YALP ISG L 


919 i 


ETPFTWEESNAYYWQPYALPL 


920 


ENTYSPNWADSMYWQPYALPL 


921 


SVGEDHNFWTSEYWQPYALPL 


922 


DG YD RWRQSGER YWQPYALPL 


923 


FEWTPGYWQPYALPL 


924 


FEWTPGYWQPY 


925 


FEWTPGYWQJY 


926 


EWTPGYWQPY 


927 


FEWTPGWYQJY 


928 


a rurrnovAA/A iv/ 

AEWTPGYWQJY 


929 


C A VA/TO AVIAI/^ IV/ 

FAWTPGYWQJY 


930 


FtATPGYWQJY 


931 






FEWTAGYWQJY 


933 


FEWTPAYWQJY 


934 


FEWTPGAWQJY 


935 


FEWTPGYAQJY 


936 


FEWTPGYWQJA 


937 


FEWTGGYWQJY 


938 


FEWTPGYWQJY 


939 


FEWTJGYWQJY 


940 


FEWTPecGYWQJY 


941 


FEWTPAibYWQJY 


942 


FEWTPSarWYQJY 


943 


FEWTSarGYWQJY 


944 
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FEWTPNYWQJY 


945 


FEWTPVYWQJY 


946 


FEWTVPYWQJY 


947 


AcFEWTPGWYOJY 


948 


AcFEWTPGYWQJY 


949 


INap-EWTPGYYQJY 


950 


YEWTPGYYQJY 


951 


FEWVPGYYQJY 


952 


FEWTPGYYOJY 


953 


FEWTPsYYQJY 


954 


FEWTPnYYQJY 


955 


SHLY-Nap-QPYSVQM 


956 


TLVY-Nap-QPYSLQT 


957 


RGDY-Nap-QPYSVQS 


958 


NMVY-Nap-QPYSIQT 


959 


VYWQPYSVQ 


960 


VY-Nap-QPYSVQ 


961 


TFVYWOJYALPL 


962 


FEWTPGYYQJ-Bpa 


963 


XaaFEWTPGYYQJ-Bpa 


964 


FEWTPGY-Bpa-OJY 


965 


AcFEWTPGY-Bpa-QJY 


966 


FEWTPG-Bpa-YQJY 


967 


AcFEWTPG-Bpa-YQJY 


968 


AcFE-Bpa-TPGYYQJY 


969 


AcFE-Bpa-TPGYYQJY 


970 


Bpa-EWTPGYYQJY 


971 


AcBpa-EWTPGYYQJY 


972 


VYWQPYSVQ 


973 


RLVYWQPYSVQR 


974 


RLVY-Nap-QPYSVQR 


975 


RLDYWQPYSVQR 


976 


RLVWFQPYSVQR 


977 


RLVYWQPYSIQR 


978 


DNSSWYDSFLL 


980 


DNTAWYESFLA 


981 


DNTAWYENFLL 


982 


PARE DNTAWYDSFLI WC 


983 


TSEY DNTTWYEKFLA SQ 


984 


SQIP DNTAWYQSFLL HG 


985 


SPFI DNTAWYENFLL TY 


986 


EQIY DNTAWYDHFLL SY 


987 


TPFI DNTAWYENFLL TY 


988 


TYTY DNTAWYERFLM SY 


989 


TMTQ DNTAWYENFLL SY 


990 


Tl DNTAWYANLVQ TYPQ 


~ 991 


Tl DNTAWYERFLA QYPD 


992 


HI DNTAWYENFLL TYTP 


993 | 


SQ DNTAWYENFLL SYKA 


994 


Ql DNTAWYERFLL QYNA 


995 
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NQ DN I AWYfcor LL UYN T 


996 


Tl fMIT AtAfX/CKICfl 1 Rll_lfc.ll 

Tl DNTAWYfcNrLL NHNL 


997 


HY DNTAWYERFLQ QGWH 


998 


rrnrriAfcrcKi a\aaa/ad\/ a i ni 

ETPFTWEtSNAYYWQPYALPL 


999 


YIPFTWEESNAYYWQPYALPL 


1000 


DGYDRWRQSGERYWQPYALPL 


1001 


pY-INap-pY-QJYALPL 


1002 


TANVSSFEWTPGYWQPYALPL 


1003 I 


FEWTPGYWQJYALPL 


1004 


FEWTPGYWQPYALPLSD 


1005 


FEWTPGYYCkiYALPL 


1006 


FEWTPGYWQJY 


1007 


AcFEWTPGYWQJY 


1008 


AcFEWTPGWYQJY 


1009 


AcFEWTPGYYQJY 


1010 


AcFEWTPaYWQJY 


1011 


AcFEWTPaWYQJY 


1012 


AcFEWTPaYYQJY 


1013 


FEWTPGYYCkiYALPL 


1014 


FEWTPGYWQJYALPL 


1015 


FEWTPGWYQJYALPL 


1016 


TANVSSFEWTPGYWQPYALPL 


1017 


AcFEWTPGYWQJY 


1018 


AcFEWTPGWYQJY 


1019 


AcFEWTPGYYQJY 


1020 


AcFEWTPAYWQJY 


1021 


AcFEWTPAWYQJY 


1022 


AcFEWTPAYYQJY 


1023 



31 
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Table 5— EPO-mimetic peptide sequences 



Sequence/structure 


SEQ 
ID NO: 


YXCXXGPXTWXCXP 


83 


YXCXXGPX7WXCXP-YXCXXGPXTWXCXP 


84 


YXHXXGPXTWXCXP-A-YXCXXGPXTWXCXP 


85 


VYf*YYf5PYT\A/Yf*YP A 

V (e-amine) 


OO 


> 

.PA 

YXCXXGPXTWXCXP-A- (a-amine) 


86 


GGTYSCHFGPL7WVCKPQGG 


87 


GGDYHCRMGPLTWVCKPLGG 


88 ; 


GGVYACRMGPITWVCSPLGG 


89 


VGNYMCHFGPITWVCRPGGG 


r 90 


GGLYLCRFGPVTWDCGYKGG 


91 


GGTYSCHFGPLTWVCKPQGG- 
GGTTSCnrGPLTwVCKPQGG 


92 


GGTYSCnrGPLTWVCKPQGvj -A- 
vjo I YoUnrbrL 1 WVUISrAjvao 




GfVTYQPWPGPI TWVPKPnGGQ^K 
vjivj3 i i Ownrur l i w v v r\ r v^\juwor\ 


QA 


GGTYSCHFGPLTWVCKPQGGSSK- 
GGTYSCHFGPLTWVCKPQGGSSK 


95 


GGTYSCHFGPLTWVCKPQGGSSK-A- 
GGTYSCHFGPLTWVCKPQGGSSK 


96 


0/%T\/PAIJCABI TIA/ \ i /"^IS n/^lt^ /"^ O 

GGTYSCHFGPLTWVCKPQGGSS V 

X (e-amine) 


97 


> 

.PA 

GGTYSCHFGPLTWVCKPQGGSS (a-anune) 


97 


GGTYSCHFGPLTV\A/CKPGGGSSK(-A-biotin) 


98 


CXXGPX fi TWX T C 


421 


GGTYSCHGPLTWVCKPQGG 


422 


VGNYMAHMGPITWVCRPGG 


423 


GGPHHVYACRMGPLTWIC 


424 


GGTYSCHFGPLTWVCKPQ 


425 


GGLYACHMGPMTWVCQPLRG 


426 


TIAQYICYMGPETWECRPSPKA 


427 


YSCHFGPLTWVCK 


428 


YCHFGPLTWVC 


429 


X^XGPXJWXX, 


124 


YXXXX.GPXJWX,X. 


461 
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419 


y yy rv y r?PY TWX CX X X 


420 


GGLYLCRFGPVTWDCGYKGG 


1024 


GGTYSCHFGPLTWVCKPQGG 


1025 


GGDYHCRMGPLTWVCKPLGG 


1026 


VGNYMCHFGPITWVCRPGGG 


1029 


GGVYACRMGPITWVCSPLGG 


1030 


VGNYMAHMGPITWVCRPGG 


1035 


GGTYSCHFGPLTWVCKPQ 


1036 


GGLYACHMGPMTWVCQPLRG 


1037 


Tl AQ Y IC YMG P ETWEC R PS PKA 


1038 


YSCHFGPLTWVCK 


1039 


YCHFGPLTWVC 


1040 


SCHFGPLTWVCK 


1041 


<AX,)JOCXGP)CrWX,X. 


1042 
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Table 6 — TPO-mimetic peptide sequences 



Sequence/structure 


SEQ 
ID NO: 


IEGPTLRQWLAARA 


13 ! 


IEGPTLRQWLAAKA 


24 


1 EG PTLR E WLAA RA 


25 


1 EG PTLRQ WLAARA- A- 1 EGPTLRQ WLAA RA 


26 


IEGPTLRQWLAAKA- A-IEGPTLRQWLAAKA 


27 


IEGPTLRQCLAARA-A-IEGPTLRQCLAARA 

1 I 


28 


IEGPTLRQWLAARA-A-K(BrAc)-A-IEGPTLRQWLAARA 


29 


IEGPTLRQWLAARA-A-K(PEG)-A-IEGPTLRQWLAARA 


30 


IEGPTLRQCLAARA-A-IEGPTLRQWLAARA 


31 


1 

IEGPTLRQCLAARA-A-IEGPTLRQWLAARA 


31 


1 EG PTLRQ WLAARA- A- 1 EG PTLRQCLAA RA 


32 


1 

1 ^ III! f— 1 s~\\ A /■ A A n A A 1 f~ |)|| r~>/~\^"» ■ a A n A 

1 EG PTLRQ WLAA R A- A - 1 EG PTLRQC LAA RA 


32 


VRDQIXXXL 


33 


TLREWL 


34 


GRVHDQVACaW 


35 


GRVKDUIAQL 


36 


oVHLaJVoVVAL 


57 


b O V H t U V M l\ Y 


3o 




^0 


va vnc i v t nnivi 


/in 


u VnCVIV fVi n FVI l_ 


41 


HRX/RnniWAAl ' 


49 


Aftx/Rnon iwi 

r\\J VnUVdllLIf IL 


4** 




44 


GRVRDQIfX^L 


45 | 


CTLRQWLQGC 


46 


CTLQEFLEGC 


47 


CTRTEWLHGC 


48 


CTLREWLHGGFC 


49 


CTLREWVFAGLC 


50 


CTLRQWLILLGMC 


51 


CTLAEFLASGVEQC 


52 


CSLQEFLSHGGYVC 


53 


CTLREFLDPTTAVC 


54 


CTLKEWLVSHEVWC 


55 1 


CTLREWL(X)„C 


56-60 


REGPTLRQWM 


61 


EGPTLRQWLA 


62 


ERGPFWAKAC i 


63 


REGPRCVMWM 


64 


CGTEG PTLSTWLDC i 


65 



Ho 
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CEQDGPTLLEWLKC 


66 


celvgpslmswltc 


67 


CLTGPFVTQWLYEC 


68 


C RAGPTLLE WLTLC 


69 


CADGPTLREWISFC 


1 70 


C(X), ,EGPTLREWL(X),*C 


71-74 


GGCTLREWLHGGFCGG 


75 


GGCADGPTLREWISFCGG 


76 


GNADGPTLRQWLEGRRPKN 


77 


LA1EGPTLRQWLHGNGRDT 


78 


HGRVGPTLREWKTQVATKK 


79 


TIKGPTLRQWLKSREHTS 


80 


ISDGPTLKEWLSVTRGAS 


81 


SIEGPTLREWLTSRTPHS 


82 
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Table 7 — G-CSF-mimetic peptide sequences 



Sequence/structure 


SEQ 


EEDCK 


QQ 




QQ 


! 1 

EEDCK 


99 


EEDaK 


100 


EEDaK 


100 


1 

EEDaK 


100 


pGluEDaK 


101 


pGluEDaK 
1 


101 


pGluEDaK 


101 


PicSDaK 


102 


PicSDoK 


102 


1 

PicSDaK 


102 


EEDCK-A-EEDCK 


103 


EEDXK-A-EEDXK 


104 
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Table 8 — TNF-antagonist peptide sequences 



Sequence/structure 


SEQ 
ID NO: 


YCFTASENHCY 


106 


YCFTNSENHCY 


i 107 


YCFTRSENHCY 


108 


FCASENHCY 


109 


YCASENHCY 


110 


FCNSENHCY 


111 


FCNSENRCY 


112 


FCNSVENRCY 


113 


YCSQSVSNDCF 


114 


FCVSNDRCY 


llD 


YCRKELGQVCY 


116 


YCKEPGQCY 


117 


YCRKEMGCY 


118 


FCRKEMGCY 


119 


YCWSQNLCY 


120 


YCELSQYLCY 


121 ! 


YCWSQNYCY 


122 ! 


YCWSQYLCY 


123 


DFLPHYKNTSLGHRP 


1085 


AA r AB, 

\ 

AC 

/ 

AA.-AB, 


NR 





Hi 
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Table 9— Integrin-binding peptide sequences 



Sequence/structure 


ID NO: 


RX.ETX.WX, 


441 


RX^TX.WX, 


442 


RGDGX 


443 


CRGDGXC 


444 




445 


CARRI DAPC 


446 




447 


X XXRGDXXX* 


448 


CX^CRGDCX^C 


449 


CDCRGDCFC 


450 


CDCRGDCLC 


451 


CLCRGDCIC 


452 


X X.DDX.X.XJC 


453 


X XJCDDX XJCXJC 


454 


CWDDGWLC 


455 


CWDDLWWLC 


456 


CWDDGLMC 


457 


CWDDGWMC 


458 


CSWDDGWLC 


459 


CPDDLWWLC 


460 


NGR 


NR 


GSL 


NR 


RGD 


NR 


CGRECPRLCQSSC 


! 1071 


CNGRCVSGCAGRC 


1072 


P.LSGSLSC 


1073 


RGD 


NR 


NGR 


NR 


GSL 


NR 


NGRAHA 


1074 


CNGRC 


1075 


CDCRGDCFC 


1076 


CGSLVRC 


1077 


DLXXL 


1043 


RTDLDSLRTYTL 


1044 


RTDLDSLRTY 


1053 


RTDLDSLRT 


1054 | 


RTDLDSLR 


1078 


GDLDLLKLRLTL 


1079 


GDLHSLRQLLSR 


1080 


RDDLHMLRLQLW 


1081 


SSDLHALKKRYG 


1082 


RGDLKQLSELTW 


1083 


RGDLAALSAPPV 


1084 
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Table 10 — Selectin antagonist peptide sequences 



Sequence/structure 


SEQ 
ID NO: 


DITWDQLWDLMK 


147 


DITWDELWKIMN 


148 


DYTWFE LWDMMQ 


149 


QITWAQLWNMMK 


150 


DMTWHDLWTLMS 


151 


DYSWHDLWEMMS 


152 


EITWDQLWEVMN 


153 


HVSWEOLWDIMN 


154 


HITWDOLWRIMT 


155 


RNM^WLEL WEHMK 


156 


APWTWnOI WHVMNIPAESO 
nut* l ft UUL t v n v ivii i r ribw>« 


157 


MRAFWI Al WFOMSP 1 


158 




159 


ITWDQLWDLMK 


160 


DITWDQLWDLMK 


161 


DITWDQLWDLMK 


162 


DITWDQLWDLMK 


163 


CQNRYTDLVAIQNKNE 


462 


AENWADNEPNNKRNNED 


463 


RKNNKTWTWVGTKKALTNE 


464 


KKALTNEAENWAD 


465 


CQXRYTDLVAIQNKXE 


466 


RKXNXXWTWVGTXKXLTEE 


467 


AENWADGEPNNKXNXED 


468 


CXXXYTXLVAIQNKXE 


469 


RKXXXXWXWVGTXKXLTXE 


470 


AXNWXXXEPNNXXXED 


471 


XKXKTXEAXNWXX 


472 



"1 
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Table 11 — Antipathogenic peptide sequences 



Sequence/structure 


SEQ I 


ID NO: 


f?FFALIPKIISSPLFKTLLSAVGSALSSSGGQQ 


503 ] 


ftFFAl IPKMSSPLFKTLLSAVGSALSSSGGQE 1 


504 


rsFFAl IPKII^RPLFKTLLSAV 


505 


ftppAi IPKIR^PLFKTLLSAV 


506 


L/r>pcAi IPKIIQQPL FKTLLSAV 


507 


ifW^FFAl IPkll^SPL FKTLLSAV 


508 


KkrtPFAl IPKII^PLFKTLLSAV 


509 


ocCAl lOLfllC 


510 


rM^AV/i tfv/i TTfJI PAI I^WIKRKROO 


511 




512 




513 


aiaawi t/\/i TTfSl PAI IQWIKR 


514 


A\#l l/l/l TTftl DAI IQ\A/il£R 

AVLKVLI loLrMLIoWiivn 


515 | 


l/l ■ 1 1 1 l/l 1 1 1 1/ 

KLLLLLKLLLLK 


516 


KLLLKLLLKLLIs 


517 


i/i ■ ■ i/i i/i i/i i i/ 

KLLLKLKLKLLK _ 


518 1 


i/i/i i i/i i/i i/i i/i^ 
KKLLKLKLl\Li\l\ 


519 1 


i/i ■ i i/i i i t/i i i/ 


520 i 


i/i i ■ t/i i/i i/i i i/ 
KLLLKLKLKLLK 


521 j 


KLLLLK 


522 


■/i i i i/i i i/ 


523 


i/i 1 1 i/i i/i i/i t i/ 
KLLLKLKLKLLK 


524 | 


i/i i i i/i i/i i/i i i/ 
KLLLKLKLKLLK 


525 


i/i i i i/i i/i i/i i i/ 

KLLLKLKLKLLK 


526 


l/A A Al/A A AI/AAI/ 


527 


l/V/\A/l/\AA/IA/\/k 


528 ! 


i/\j\/\/ i/w i/v/i^VV/ If 


529 


IfVAA/lfVlfX/lfVK 


530 




531 


i/i it |^| 


532 


L/Wl Ul 1 


533 


LKLRLL 


534 


KPLHLL 


535 


KLILKLVR 


536 


KVFHLLHL 


537 


HKFRILKL 


538 


KPFHILHL 


539 


KlflKIKKKIIK 


540 


KIIIKIKIKIIK 


541 


KI1IKIKIKIIK 


542 


KIPIKIKIKIPK 


543 


KIPIKIKIKIVK 


544 


RIIIRIR1RIIR 


545 


RIIIRIRIRIIR 


546 


RIIIR1RIRIIR 


547 


RIVIRIRIRLIR 


548 



% 
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DIIV/DIDI DIIQ 

HIIVnlnLnlln 




niolnLHVrillrt 




l/IV/IDIDIDI ID 

KIVIHlHInUH 


DDI 


RIAVKWRLHFIK 




KIGWKLRVRIIR 




KKIGWLIIRVRR 


CC>4 

554 


RIVIRIRIRLIRIR 




RIIVRIRLRIIRVR 


556 


RIGIRLRVRIIRRV 


557 


KIVIRIRARLIRIRIR 


558 


RIIVKIRLRIIKKIRL 


559 


KIGIKARVRIIRVKII 


560 


RIIVHIRLRIIHHIRL 


561 


HIGIKAHVRIIRVHII 


562 


RIYVKIHLRYIKKIRL 


563 


KIGHKARVHIIRYKII 


564 


RIYVKPHPRYIKKIRL 


565 


KPGHKARPHIIRYKII 


566 


KIVIRIRIRLIRIRIRKIV 


567 


RIIVKIRLRIIKKIRLIKK 


568 


KIGWKLRVRIIRVKIGRLR 


569 


KIVIRIRIRURIRIRKIVKVKRIR 


570 


RFAVKIRLRIIKKIRLIKKIRKRVIK 


571 


KAGWKLRVRIIRVKIGRLRKIGWKKRVRIK 


572 


RIYVKPHPRYIKKIRL 


573 


KPGHKARPHIIRYKII 


574 


KIVIRIRIRLIRIRIRKIV 


575 


RIIVKIRLRIIKKIRLIKK 


576 


RIYVSKISIYIKKIRL 


577 


KIVIFTRIRLTSIRIRSIV 


578 


KPIHKARPT1IRYKMI 


579 


cyclicCKGFFALIPKIISSPLFKTLLSAVC 


580 


CKKGFFALIPKIISSPLFKTLLSAVC 


581 


CKKKGFFALIPKIISSPLFKTLLSAVC 


582 


CyclicCRIVIRIRIRLIRIRC 


583 


CyclicCKPGHKARPHIIRYKIIC 


584 


CyclicCRFAVKIRLRIIKKIRLIKKIRKRVIKC 


585 


KLLLKLLL KLLKC 1 


586 


KLLLKLLLKLLK 


587 


KLLLKLKLKLLKC 


588 


KLLLKLLLKLLK 


589 



V 
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Table 12 — VIP-mimetic peptide sequences 



Sequence/structure 


SEQ 
ID NO: 


M^nAV/PYnNIYTR 1 RKOMAVKKYl N QILN 

nOUMVl I L/IN T in LniWdllVlrt Vf\l\ 1 OlUi 


590 


Mia UQnAVPVnNVTR 1 RkHMAV/kkYI W Qll N 
INie ngunVrTUliT in L.rlf\lJIVIMVr\f\ T LTM OIL-PI 


591 


V V ' Y " Y 


592 


Y C Y 1 Kl 


593 


mu r^u rn i^kvy*? mm r*w rn ya 

IMM \jri UU INIVTAO IMM v/rl V/U AO 

I | 


594 


1 1 
Vwilfc Jin c \\jiifcji i 




KKYI 
r\r\ i l. 


595 


MQII M 


596 


KKYI 

r\l\T L 


597 


l\l\T A 


598 


avi\i\yl 




MQII M 


fjr\r\ 


l^tf"V\/ 

I\i\yv 


OUl 


Oil aiiKI 




l/l/VI Ml a 




MO VI M 


OUT 


MCIVM 

NolYIM 




l/'l/VI DDKICII M 

KKYLrriMolLiM 


ovo 


LaUI\l\YL 




oapKKYL 


ADR 


l^VI 


IN IV. 


l\l\YiNIB 




V/KKVI 
V i\r\ T L. 


610 


1 MQII M 
LlMolLJN 


611 


VI MQII M 


612 


KKYI M 
r\r\ T L.IN 


613 


KKYI NQ 


614 


KKYI NQI 
r\r\ i uioi 


615 


KKYLNSIL ^ 

r\r\ i uiuiL 


616 


KKYL 


617 


KKYDA 


618 


AVKKYL 


619 


NSILN 


620 


KKYV 


621 j 


SILauN 


622 | 


NSYLN 


623 


NSIYN 


624 


KKYLNIe 


625 


KKYLPPNSILN 


626 


KKYL 


627 


KKYDA 


'628- 


AVKKYL 1 


629 


NSILN 


630 


KKYV 


631 


SILauN 


632 
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1 a. .L/l/X/l 

LauKKYL 




CapKKYL 




KYL 


INK 


KYL 


NR 


KKYNIe 


635 


VKKYL 


636 


LNSILN 


637 


YLNSILN 


638 


KKYLNIe 


I 639 


KKYLN 


640 


KKYLNS 


641 


KKYLNSI 


642 


KKYLNSIL 


643 


KKKYLD 


644 


cyclicCKKYLC 


645 


CKKYLK 

1 1 


646 


S-CH,-CO 




KKYA 


647 


WWTDTGLW 


648 


WWTDDGLW 


649 


WWDTRGLWVWT1 


650 


FWGNDGIWLESG 


651 


DWDQFGLWRGAA 


652 


R WDDNG LWVWL 


653 


SGMWSHYGIWMG 


654 


GGRWDQAGLWVA 


655 


KLWSEQG 1 WMGE 


656 


CWSMHGLWLC 


657 




DJO 


DWDTRGLWVY 


659 


SLWDENGAWI 


660 


KWDDRGLWMH 


661 


QAWNERGLWT 


662 


QWDTRG LWV A 


663 


WNVHGIWQE 


664 


SWDTRGLWVE 


665 


DWDTRGLWVA 


666 


SWGRDGLWIE 


667 


EWTDNGLWAL 


668 | 


SWDEKGLWSA 


669 


SWDSSGLWMD 


670 
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Table 13— Mdm/hdm antagonist peptide sequences 



Sequence/structure 


SEQ 


ID NO: 


TFSDLW 


130 


QETFSDLWKLLP 


131 


QPTFSDLWKLLP 


132 


QFTPQnvwkl l P 


133 


QPTFSDYWKLLP 


134 


MPRFMDYWEGLN 


135 


VQNFIDYWTQQF 


loo 


TGPAFTHYWATF 


137 


inDADTCDHUIWCAl W 


138 


qddai % /c a p\v\A/crn v 1 


139 


n a CCDQAfCni CA^AU 


140 


DACODQA/Cl^l QA^AM 


141 


DVCVr\\AA/YYI 


142 


QETFSDLWKLLP 


143 


ADTCCr\l \hllf\ 1 D 


144 


ACTCOPkNAA/l^l t D 


145 


ADTCOnV\A/l^l 1 D 


146 


TnVilo 14 (Calmodulin antagonist peptide 


» sequences 


Sequence/structure 


SEQ 


ID NO: 


cirvKWGKKEFCGS 


164 


QPWKYWfi KECGS 


165 


«?£V/FWGKLRWCGS 


166 | 


€>HI RWGKWSNCGS 


167 


SCWRWGKYQICGS 


168 


SCVSWGALKLCGS 


169 


SCIRWGQNTFCGS 


170 


SCWQWGNLKICGS 


171 


SCVRWGQLSICGS 


172 


LKKFNARRKLKGAILTTMLAK 


173 


RRWKKN Fl AVS AAN RFKK 


174 


RKWQKTGHAVRAIGRLSS 


175 


INLKALAALAKKIL 


176 


KIWSILAPLGTTLVKLVA 


177 


LKKLLKLLKKLLKL 


178 


LKWKKLLKLLKKLLKKLL 


! 179 


AEWPSLTEIKTLSHFSV 


180 


AEWPSPTRVISTTYFGS 


1 181 


AELAHWPPVKTVLRSFT 


182 


AEGSWLQLLNLMKQMNN 


183 


AEWPSLTEIK 


184 
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Table 15 — Mast cell antagonists/Mast cell protease inhibitor 



peptide sequences 



Sequence/structure 


SEQ 
ID NO: 


SGSGVLKRPLPILPVTR 


272 


RWLSSRPLPPLPLPPRT 


273 


GSGSYDTLALPSLPLHPMSS 


274 


GSGSYDTRALPSLPLHPMSS 


275 


GSGSSGVTMYPKLPPHWSMA 


276 


GSGSSGVRMYPKLPPHWSMA 


277 


GSGSSSMRMVPTIPGSAKHG 


278 


RNR 


NR 


QT 


NR 


RQK 


NR 


NRQ 


NR 


RQK 


NR 


RNRQKT 


436 


RNRQ 


437 


RNRQK 


438 


NRQKT 


439 


RQKT 


440 



SI 
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Table 16 — SH3 antagonist peptide sequences 



Sequence/structure 


SEQ 
ID NO: 


RPLPPLP 


282 


RELPPLP 


283 


SPLPPLP 1 


284 


GPLPPLP 


285 


RPLPIPP 


286 


RPLPIPP 


287 


RRI PPTP 

nnLr r 1 1 


288 


PHI PPTP 


289 


Dpi DCDD 

nrLrOnr 


290 


PPI PTRP 1 


291 


QQI DDI p 


292 


DAI PCPP 


293 


ppi PRTP 

nnLr n 1 r 


294 


QP Wpp IT 
nr Vr rl I 


295 


II APPV/P 
ILnrrVr 


296 


Dpi PMI P 


297 


PPI Pll P 


298 


ppl PQ| P 
nrLrOLr 


299 


ppi pci p 


300 


RPLPMIP 
nr i.r iviir 


301 


RPLPLIP 


302 


RPLPPTP 


303 


RSLPPLP 


304 


RPQPPPP 


305 


RQLPIPP 


306 


XXXRPLPPLPXP 


307 


XXXRPLPPIPXX 


308 


XXXRPLPPLPXX 


309 


RXXRPLPPLPXP 


310 


RXXRPLPPLPPP 


311 


PPPYPPPPIPXX 


312 


pppyppppvpxx 


313 


LXXRPLPX^P 


314 


^XXRPLPXLP 


315 


PPX0XPPPYP 


316 


+PP v rPXKPXWL 


317 


RPX^P^R+SXP 


318 


PPVPPRPXXTL 


319 


H'PTLPH'K 


320 


+0DXPLPXLP 


321 
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Table 17— Somatostatin or cortistatin mimetic peptide sequences 



Sequence/structure 


*>EO 

JCy 

ID NL/: ! 


X'-X a -Asn-Phe-Phe•Trp-Lys-mr-Phe-X J -Ser-X , 


473 


Asp Ara Met Pro Cvs Ara Asn Phe Phe Tro Lvs Thr pne ser ser uys Lys 


474 


Met Pro Cys Ara Asn Phe Phe Trp Lys Thr Phe Ser Ser Cvs Lys 


475 


Cvs Arq Asn Phe Phe Trp Lvs Thr Phe Ser Ser Cys Lys 


476 


Asp Arq Met Pro Cvs Arg Asn Phe Phe Tro Lvs Thr Phe Ser Ser Cys 


477 


Met Pro Cys Ara Asn Phe Phe Tro Lys Thr Phe Ser Ser Cvs 


478 


Cys Arg Asn Phe Phe Tro Lys Thr Phe Ser Ser Cys 


479 


Asp Arg Met Pro Cvs Lys Asn Phe Phe Trp Lvs Thr Phe Ser Ser Cys 


480 


Met Pro Cys Lvs Asn Phe Phe Trp Lys Thr Phe Ser Ser Cvs Lys 


481 


Cvs Lys Asn Phe Phe Trp Lvs Thr Phe Ser Ser Cys Lys 


482 


Asp Arg Met Pro Cvs Lys Asn Phe Phe Tro Lys Thr Phe Ser Ser Cys 


483 


Met Pro Cys Lvs Asn Phe Phe Trp Lys Thr Phe Ser Ser Cys 




Cvs Lys Asn Phe Phe Trp Lys Thr Phe Ser Ser Cys 


485 


Aso Arq Met Pro Cvs Arg Asn Phe Phe Trp Lys Thr Phe Thr Ser Cvs Lys 


486 


Met Pro Cys Ara Asn Phe Phe Trp Lys Thr Phe Thr Ser Cvs Lys 


487 


Cvs Arq Asn Phe Phe Trp Lvs Thr Phe Thr Ser Cys Lys 


| 488 


Asp Arq Met Pro Cys Arg Asn Phe Phe Trp Lvs Thr Phe Thr Ser Cys 


489 


Met Pro Cvs Arq Asn Phe Phe Trp Lvs Thr Phe Thr Ser Cys 


490 


Cys Arg Asn Phe Phe Trp Lys Thr Phe Thr Ser Cys 


i 491 


Asp Arq Met Pro Cys Lys Asn Phe Phe Tro Lys Thr Phe Thr Ser Cys Lys 


492 


Met Pro Cys Lvs Asn Phe Phe Trp Lys Thr Phe Thr Ser Cvs Lys 


493 


Cys Lys Asn Phe Phe Tro Lys Thr Phe Thr Ser Cys Lys 


494 


Asp Arg Met Pro Cvs Lys Asn Phe Phe Tro Lvs Thr Phe Thr Ser Cys 


495 


Met Pro Cvs Lys Asn Phe Phe Trp Lvs Thr Phe Thr Ser Cys 


496 


Cys Lys Asn Phe Phe Tro Lys Thr Phe Thr Ser Cys 


497 
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Table 18 — UKR antagonist peptide sequences 



Sequence/structure 


SEQ 
ID NO: 


ACDMDUQI NPQHYI VA/VT 
Mtr MrnoUNrOUT LVV T 1 


196 


ACUTVCCI UUHTVCDI AC 
Atn 1 TooLWU 1 TgrUM" 


197 

17/ 


A t LU L W Ivl n PI T r Lo r o in n 


198 

I/O 


Aceei \A/TDV A VA/DCMDCV 
AtooLW 1 nTMWroMroT 


199 

177 


AtWnrbLorboTLVVoN 1 




A t r ALUM W o r r r IN r va LTI 


£AJ 1 


AEWSrYNLrlLrhrvJl lr 




APPI ni W<5I Y^l PPL AM 


203 


AEPTLWQLYQFPLRLSG 


204 


AEISFSELMWLRSTPAF 


205 


AELSEADLWTTWFGMGS 


206 


AESSLWRIFSPSALMMS 


207 


AESLPTLTSILWGKESV 


208 


AETLFMDLWHDKHILLT 


209 


AE1LNFPLWHEPLWSTE 


210 


AESQTGTLNTLFWNTLR 


211 


AEPVYQYELDSYLRSYY 


430 


AELDLSTFYDIQYLLRT 


431 


AEFFKLGPNGYVYLHSA 


432 


FKLXXXGYVYL 


433 


AESTYHHLSLGYMYTLN 


434 


YHXLXXGYMYT 


435 
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Table 19 — Macrophage and/or 



T-cell inhibiting peptide sequences 



Sequence/structure 


SEO 
I LI inci: 


Xaa-Yaa-Arg 


NR 


Arg-Yaa-Xaa 


NR 


Xaa-Arg-Yaa - 


NR 


Yaa-Arg-Xaa . — - 


NR 


Ala-Arg 


NR 


Arg-Arg 


NR 


Asn-Arg 


NR 




NR 


Cys-Arg 


NR 


Gln-Arg 


NR 


GIVi-Arn 


NR 


Glv-Arq 


NR 


His-arg 


NR 


He-Arg 


NR 


1 eu-Arn 


NR 


1 vs-Ara 


NR 


Met-Arg 


NR 


Phe-Arg 


NR 1 


Ser-Arg 


NR 


Thr-Arg 


NR 


Trp-Arg 


NR 


Tyr-Arg 


NR 


Val-Arg 


NR 


Ala-Glu-Arg 


NR 


Arg-Glu-Arg 


NR 


Asn-Glu-Arg 


NR 


Asp-Glu-Arg 


NR 


Cys-Glu-Arg 


NR 


Gln-Gfu-Arg 


NR 


Glu-Glu-Arg 


NR 


Gly-Glu-Arg 


NR 


His-Glu-Arg 


NR 


lle-Glu-Arg 


NR 


Leu-Glu-Arg 


NR 


lys-Glu-Arg 


NR 


Met-Glu-Arg 


NR 


Phe-Glu-Arg 


NR 


Pro-Glu-Arg 


NR 


Ser-Glu-Arg 


- NR 


Thr-Glu-Arg 


NR 


Trp-Glu-Arg 


NR 


Tyr-Glu-Arg 


NR 


Val-Glu-Arg 


NR 
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Arg-Ala 


NK 


Arg-Asp 


NK 


Arg-Cys 


NK 


Arg-GIn 


NK 


Arg-Glu 


NK 


Arg-Gly 


NR 


Arg-His 


NR 


Arg-lle 


NR 


Arg-Leu 


NR 


Arg-Lys 


NR 


Arg-Met 


NR 


Arg-Phe 


NR 


Arg-Pro 


NR 


Arg-Ser 


NR 


Arg-Thr 


NR 


Arg-Trp . - 


NR 


Arg-Tyr 


NR 


Arg-Val 


NR 


Arg-Glu-Ala 


NR 


Arg-Glu-Asn 


NR 


Arg-Glu-Asp 


NR 


Arg-Glu-Cys 


NR 


Arg-Giu-GIn 


NR 


Arg-Glu-Glu 


NR 


Arg-Glu-Gly 


NR 


Arg-Glu-His 


NR 


Arg-Glu-lie 


NR 


Arg-Glu-Leu 


NR 


Arg-Glu-Lys 


NR 


Arg-Glu-Met 


NR 


Arg-Glu-Phe 


NR 


Arg-Glu-Pro 


NR 


Arg-Glu-Ser 


NR 


Arg-Glu-Thr 


NR 


Arq-Glu-Trp 


NR 


Arg-Glu-Tyr 


NR 


Arg-GIu-Val 


NR 


Ala-Arg-Glu 


NTR 

IN 1\ 


Arg-Arg-Glu 


NR 


Asn- Arg-Glu 


NR 


Asp-Arg-Glu 


NR 


Cys-Arq-Glu 


NR 


Gln-Arg-Glu 


NR 


Glu-Arg-Glu 


NR 


Gly-Arq-Glu 


NR 


His-Arg-Glu 


NR 


lle-Arg-Glu 


NR 


Leu-Arg-Glu 


NR 


Lys-Arg-G!u 


NR 


Met-Arg-Glu 


NR 
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pka. A rn-f^li I 


NR ! 


D rr\ _ A rn - 1 1 1 


NR 


Car A rn _ f"^ 1 1 1 


NR 


TU r A rn -fill i 


NR 


T rr\_ A m 1 1 1 


NR 


Tiif Am /"^lii 


NR 


\/n| Am f^lll 


NR ! 


(jlU-Arg-Ala, m , „^ — 


NR ! 


Gtu-Arg-Arg 


NR 


Glu-Arg-Asn 


NR 


Glu-Arg-Asp 


NR 


Giu-Arg-Cys 


NR 


Glu-Arg-GIn 


NR I 


nhi-Arn-Glv 


NR 


Glu-Arg-His 


NR I 


Glu-Arg-lte a . 


NR 


Glu-Arg-Leu _ 


NR 


G)u-Arg-Lys m - 


NR ! 


Glu-Arg-Met 


! NR 


Glu-Arg-Phe 


NR 


Glu-Arg-Pro a 


NR 


Glu-Arg-Ser 


NR 


Glu-Arg-Thr 


NR 


Glu-Arg-Trp 


NR 


Glu-Arg-Tyr 


NR 


Glu-Arg-Val 


NR 



SI 
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Table 20— Additional Exemplary Pharmacologically Active Peptides 



Sequence/structure 


SEQ 
ID 
Nth 


Activity 


VEPNCDIHVMWEWECFERL 


luz/ 


vtor-aniagonisi 


GERWCFDGPLTWVCGEES 


10o4 


vtijr-aniagonisi 


RGWVEICVADDNGMCVTEAQ 


IvJoo 


vtur* an tag on isi 


GWDECDVARMWEWECFAGV 


1086 


VcCar- antagonist 


G E R WC FDG P R A WVCG WE 1 


501 


vtur- antagonist 


EELWCFDGPRAWVCGYVK 


502 


VEGF- antagonist 


RGWVEICAADDYGRCLTEAQ 


1031 


VEGF- antagonist 


RGWVEICESDVWGRCL 


1087 


VEGF- antagonist 


RGWVEICESDVWGRCL 


1088 


VEGF- antagonist 


GGNECDIARMWEWECFERL 


1089 


VEGF- antagonist 


RGWVEICAADDYGRCL 


1090 


VEGF-antagonist 1 


CTTHWGFTLC 


1028 


ft Aft M • • 

MMP inhibitor 


CLRSGXGC 


1091 


MMP inhibitor 


CXXHWGFXXC 


1092 


MMP inhibitor 


CXPXC 


1093 


MMP inhibitor 


CRRHWGFEFC 


1094 


MMP inhibitor 


STTHWGFTLS 


1095 


MMP inhibitor 


CSLHWGFWWC 


1096 


CTLA4-mimetic 


GFVCSGIFAVGVGRC 


125 


CTLA4-mimetic ! 


APGVRLGCAVLGRYC 


126 


CTLA4-mimetic 


LLGRMK 


105 


Antiviral (HBV) 


ICVVQDWGHHRCTAGHMANLTSHASAl 


127 


C3b antagonist 


ICVVQDWGHHRCT 


128 


C3b antagonist 


CWQDWGHHAC 


129 


C3b antagonist 


STGGFDDVYDWARGVSSALTTTLVATR 


185 


Vinculin-binding 


STGGFDDVYDWARRVSSAL 1 1 ILVATR 


186 


Vinculin-binding 


SRGVNFSEWLYDMSAAMKEASNVFPSRRSR 


187 


Vinculin-binding 


SSQNWDMEAGVEDLTAAMLGLLSTIHSSSR 


188 


Vinculin-binding 


SSPSLYTQFLVNYESAATRIQDLLIASRPSR 


189 


Vinculin-binding 


SSTGWVDLLGALQRAADATR1 SIPPSLUNSH 




viiiwuuii un iu i • <y 


DVYTKKELIECARRVSEK 


191 


Vinculin-binding 


EKGSYYPGSGIAQFHIDYNNVS 


192 


C4BP-binding 


SG1AQFHIDYNNVSSAEGWHVN 


193 


C4BP-binding 


LVTVEKGSYYPGSGIAQFHIDYNNVSSAEGWHVN 


194 


C4BP-binding 


SGIAQFHIDYNNVS 


| 195 


C4BP-binding 


LLGRMK 


279 


anti-HBV 


ALLGRMKG 


280 


anti-HBV 


LDPAFR 


281 


anti-HBV 


CXXRGDC 


322 


Inhibition of platelet 
aaareaation 


RPLPPLP 


323 


Src antagonist 


PPVPPR 


324 


Src antagonist 


XFXDXWXXLXX 


325 


Anti-cancer 
(particularly for 
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sarcomas) 


KACRRLFGPVDSEQLSRDCD 


326 


p16-mimetic 


RERWNFDFVTETPLEGDFAW 


327 


p16-mimetic 


KRRQTSMTDFYHSKRRLIFS 


328 


p16-mimetic 


TSMTDFYHSKRRUFSKRKP 


329 


p16-mimetic 


RRLIF 


330 


p1 6-mimetic 


KRRQTSATDFYHSKRRLIFSRQIKIWFQNRRMKWKK 


331 


p1 6-mimetic 


KRRLIFSKRQIKIWFQNRRMKWKK 


332 


p1 6-mimetic 


A<;n Gtn Glv Ara His Phe Cvs Gtv Gty Ala Leu He His Ala 
Arq Phe Val Met Thr Ala Ala Ser Cvs Phe Gin 


498 


CAP37 mimetic/LPS 
binding 


Arg His Phe Cys Gly Gly Ala Leu ile His Ala Arg Phe Val 
Met Thr Ala Ala Ser Cys 


499 


CAP37 mimetic/LPS 
binding 


Gly Thr Arg Cys Gin Val Ala Gly Tip Gly Ser Gin Arg Ser 
Gly Gly Arg Leu Ser Arg Phe Pro Arg Phe Val Asn Val 


500 


CAP37 mimetic/LPS 
binding 


WHWRHRIPLQLAAGR 


1097 


carbohydrate (GD1 
alpha) mimetic 


LKTPRV 


1098 


p2GPl Ab binding 


NTLKTPRV 


1099 


32GPI Ab binding 


NTLKTPRVGGC 


1100 


02GPI Ab binding 


KDKATF 


1101 


02GPI Ab binding 


KDKATFGCHD 


1102 


p2GPi ad binding 


KDKATFGCHDGC 


1103 


02GPI Ab binding 


TLRVYK 


1104 


02GPI Ab binding 


ATLRVYKGG 


1105 


02GPI Ab binding 


CATLRVYKGG 


1106 


P2GPI Ab binding I 


INLKALAALAKKIL 


1107 


Membrane- 
transporting 


GWT 


NR 


Membrane- 
transporting 


GWTLNSAGYLLG 


1108 


Membrane- 
transporting 


GWTLNSAGYLLGKINLKALAALAKKIL 


1109 


Membrane- 
transporting 



having activity in treatment of: 

• cancer, wherein the peptide is a VEGF-mimetic or a VEGF receptor 
antagonist, a HER2 agonist or antagonist, a CD20 antagonist and the 
like; 

• asthma, wherein the protein of interest is a CKR3 antagonist, an IL-5 
receptor antagonist, and the like; 

• thrombosis, wherein the protein of interest is a GPIIb antagonist, a 
GPffla antagonist, and the like; 

f1 
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• autoimmune diseases and other conditions involving immune 
modulation, wherein the protein of interest is an IL-2 receptor 
antagonist, a CD40 agonist or antagonist, a CD40L agonist or 
antagonist, a thymopoietin mimetic and the like. 

Vehicles . This invention requires the presence of at least one vehicle 
(F 1 , F 2 ) attached to a peptide through the N-terminus, C-terminus or a 
sidechain of one of the amino acid residues. Multiple vehicles may also be 
used; e.g., Fc's at each terminus or an Fc at a terminus and a PEG group at 
the other terminus or a sidechain. 

An Fc domain is the preferred vehicle. The Fc domain may be fused 
to the N or C termini of the peptides or at both the N and C termini. For 
the TPO-mimetic peptides, molecules having the Fc domain fused to the N 
terminus of the peptide portion of the molecule are more bioactive than 
other such fusions, so fusion to the N terminus is preferred. 

As noted above, Fc variants are suitable vehicles within the scope of 
this invention. A native Fc may be extensively modified to form an Fc 
variant in accordance with this invention, provided binding to the salvage 
receptor is maintained; see, for example WO 97/34631 and WO 96/32478. 
In such Fc variants, one may remove one or more sites of a native Fc that 
provide structural features or functional activity not required by the 
fusion molecules of this invention. One may remove these sites by, for 
example, substituting or deleting residues, inserting residues into the site, 
or truncating portions containing the site. The inserted or substituted 
residues may also be altered amino acids, such as peptidomimetics or D- 
amino acids. Fc variants may be desirable for a number of reasons, several 
of which are described below. Exemplary Fc variants include molecules 
and sequences in which: 

1. Sites involved in disulfide bond formation are removed. Such removal 
may avoid reaction with other cysteine-containing proteins present in 

fr6 
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the host cell used to produce the molecules of the invention. For this 
purpose, the cysteine-containing segment at the N-terminus may be 
truncated or cysteine residues may be deleted or substituted with other 
amino acids (e.g., alanyl, seryl). In particular, one may truncate the N- 
5 terminal 20-amino acid segment of SEQ ID NO: 2 or delete or 

substitute the cysteine residues at positions 7 and 10 of SEQ ID NO: 2. 
Even when cysteine residues are removed, the single chain Fc domains 
can still form a dimeric Fc domain that is held together non-covalently. 

2. A native Fc is modified to make it more compatible with a selected host 
1 0 cell. For example, one may remove the PA sequence near the N- 

terminus of a typical native Fc, which may be recognized by a digestive 
enzyme in E. coli such as proline iminopeptidase. One may also add an 
N-terminal methionine residue, especially when the molecule is 
expressed recombinantly in a bacterial cell such as E. coli . The Fc 
1 5 domain of SEQ ID NO: 2 (Figure 4) is one such Fc variant. 

3. A portion of the N-terminus of a native Fc is removed to prevent N- 
terminal heterogeneity when expressed in a selected host cell. For this 
purpose, one may delete any of the first 20 amino acid residues at the 
N-terminus, particularly those at positions 1, 2, 3, 4 and 5. 

20 4. One or more glycosylation sites are removed. Residues that are 

typically glycosylated (e.g., asparagine) may confer cytolytic response. 
Such residues may be deleted or substituted with unglycosylated 
residues (e.g., alanine). 
5. Sites involved in interaction with complement, such as the Clq binding 

2 5 site, are removed. For example, one may delete or substitute the EKK 

sequence of human IgGl. Complement recruitment may not be 
advantageous for the molecules of this invention and samay be 
avoided with such an Fc variant. 
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10 



15 



20 



25 



6. Sites are removed that affect binding to Fc receptors other than a 
salvage receptor. A native Fc may have sites for interaction with 
certain white blood cells that are not required for the fusion molecules 
of the present invention and so may be removed. 

7. The ADCC site is removed. ADCC sites are known in the art; see, for 
example, MoW. Immunol . 29 (5): 633-9 (1992) with regard to ADCC 
sites in IgGl. These sites, as well, are not required for the fusion 
molecules of the present invention and so may be removed. 

8. When the native Fc is derived from a non-human antibody, the native 
Fc may be humanized. Typically, to humanize a native Fc, one will 
substitute selected residues in the non-human native Fc with residues 
that are normally found in human native Fc. Techniques for antibody 
humanization are well known in the art. 

Preferred Fc variants include the following. In SEQ ID NO: 2 
(Figure 4) the leucine at position 15 may be substituted with glutamate; the 
glutamate at position 99, with alanine; and the lysines at positions 101 and 
103, with alanines. In addition, one or more tyrosine residues can be 
replaced by phenyalanine residues. 

An alternative vehicle would be a protein, polypeptide, peptide, 
antibody, antibody fragment, , or small molecule (e.g., a peptidomimetic 
compound) capable of binding to a salvage receptor. For example, one 
could use as a vehicle a polypeptide as described in U.S. Pat. No. 5,739,277, 
issued April 14, 1998 to Presta etal. Peptides could also be selected by 
phage display for binding to the FcRn salvage receptor. Such salvage 
receptor-binding compounds are also included within the meaning of 
"vehicle" and are within the scope of this invention. Such vehicles should 
be selected for increased half-life (e.g., by avoiding sequences recognized 
by proteases) and decreased immunogenicity (e.g., by favoring non- 
immunogenic sequences, as discovered in antibody humanization). 
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As noted above, polymer vehicles may also be used for F' and ¥\ 
Various means for attaching chemical moieties useful as vehicles are 
currently available, see, e.g., Patent Cooperation Treaty ("PCT") 
International Publication No. WO 96/11953, entitled "N-Terminally 
5 Chemically Modified Protein Compositions and Methods," herein 

incorporated by reference in its entirety. This PCT publication discloses, 
among other things, the selective attachment of water soluble polymers to 
the N-terminus of proteins. 

A preferred polymer vehicle is polyethylene glycol (PEG). The PEG 
1 0 group may be of any convenient molecular weight and may be linear or 
branched. The average molecular weight of the PEG will preferably range 
from about 2 kiloDalton ("kD") to about 100 kDa, more preferably from 
about 5 kDa to about 50 kDa, most preferably from about 5 kDa to about 
10 kDa. The PEG groups will generally be attached to the compounds of 
15 the invention via acylation or reductive alkylation through a reactive 

group on the PEG moiety (e.g., an aldehyde, amino, thiol, or ester group) 
to a reactive group on the inventive compound (e.g., an aldehyde, amino, 
or ester group). 

A useful strategy for the PEGylation of synthetic peptides consists 
20 of combining, through forming a conjugate linkage in solution, a peptide 
and a PEG moiety, each bearing a special functionality that is mutually 
reactive toward the other. The peptides can be easily prepared with 
conventional solid phase synthesis (see, for example, Figures 5 and 6 and 
the accompanying text herein). The peptides are "preactivated" with an 
2 5 appropriate functional group at a specific site. The precursors are purified 
and fully characterized prior to reacting with the PEG moiety. Ligation of 
the peptide with PEG usually takes place in aqueous phase and can be 
easily monitored by reverse phase analytical HPLC. The PEGylated 
peptides can be easily purified by preparative HPLC and characterized by 
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analytical HPLC, amino acid analysis and laser desorption mass 
sp ectrometry . 

Polysaccharide polymers are another type of water soluble polymer 
which may be used for protein modification. Dextrans are polysaccharide 
5 polymers comprised of individual subunits of glucose predominantly 
linked by al-6 linkages. The dextran itself is available in many molecular 
weight ranges, and is readily available in molecular weights from about 1 
kD to about 70 kD. Dextran is a suitable water soluble polymer for use in 
the present invention as a vehicle by itself or in combination with another 

1 o vehicle (e.g., Fc). See, for example, WO 96/11953 and WO 96/05309. The 

use of dextran conjugated to therapeutic or diagnostic immunoglobulins 
has been reported; see, for example, European Patent Publication No. 0 
315 456, which is hereby incorporated by reference. Dextran of about 1 kD 
to about 20 kD is preferred when dextran is used as a vehicle in 
1 5 accordance with the present invention. 

Linkers . Any 'Tinker" group is optional. When present, its chemical 
structure is not critical, since it serves primarily as a spacer. The linker is 
preferably made up of amino acids linked together by peptide bonds. 
Thus, in preferred embodiments, the linker is made up of from 1 to 20 

2 0 amino acids linked by peptide bonds, wherein the amino acids are selected 

from the 20 naturally occurring amino acids. Some of these amino acids 
may be glycosylated, as is well understood by those in the art. In a more 
preferred embodiment, the 1 to 20 amino acids are selected from glycine, 
alanine, proline, asparagine, glutamine, and lysine. Even more preferably, 
2 5 a linker is made up of a majority of amino acids that are sterically 

unhindered, such as glycine and alanine. Thus, preferred linkers are 
polyglycines (particularly (Gly),, (Gly)^, poly(Gly-Ala), and polyalanines. 
Other specific examples of linkers are: 

(Gly) 3 Lys(Gly) 4 (SEQ ID NO: 333); 
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(Gly),AsnGlySer(Gly) 2 (SEQ ID NO: 334); 
(Gly),Cys(Gly). (SEQ ID NO: 335); and 
GlyProAsnGlyGly (SEQ ID NO: 336). 
To explain the above nomenclature, for example, (Gly) 3 Lys(Gly) 4 means 
5 Gly-Gly-Gly-Lys-Gly-Gly-Gly-Gly. Combinations of Gly and Ala are also 
preferred. The linkers shown here are exemplary; linkers within the scope 
of this invention may be much longer and may include other residues. 

Non-peptide linkers are also possible. For example, alkyl linkers 
such as -NFHCHA-aO)-, wherein s = 2-20 could be used. These alkyl 
1 o linkers may further be substituted by any non-sterically hindering group 
such as lower alkyl (e.g., C,-Q) lower acyl, halogen (e.g., CI, Br), CN, NH,, 
phenyl, etc. An exemplary non-peptide linker is a PEG linker, 
VI 

O 



15 




wherein n is such that the linker has a molecular weight of 100 to 5000 kD, 
preferably 100 to 500 kD. The peptide linkers may be altered to form 
derivatives in the same manner as described above. 

Derivatives . The inventors also contemplate derivatizing the 
2 0 peptide and/or vehicle portion of the compounds. Such derivatives may 
improve the solubility, absorption, biological half life, and the like of the 
compounds. The moieties may alternatively eliminate or attenuate any 
undesirable side-effect of the compounds and the like. Exemplary 
derivatives include compounds in which: 
25 1 . The compound or some portion thereof is cyclic. For example, the 

peptide portion may be modified to contain two or more Cys residues 
(e.g., in the linker), which could cydize by disulfide bond formation. 
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For citations to references on preparation of cyclized derivatives, see 
Table 2. 

2. The compound is cross-linked or is rendered capable of cross-linking 
between molecules. For example, the peptide portion may be modified 



intermolecular disulfide bond with a like molecule. The compound 
may also be cross-linked through its C-terminus, as in the molecule 
shown below. 
VII 



4 . One or more peptidyl [-C(0)NR-] linkages (bonds) is replaced by a 
non-peptidyl linkage. Exemplary non-peptidyl linkages are -CH 2 - 
carbamate [-CH 2 -OC(0)NR-L phosphonate , -CH 2 -sulfonamide [-CHL,- 
SCOyMR-L urea [-NHC(0)NH-1, -CH^-secondary amine, and alkylated 
peptide [-C(0)NR*- wherein R 6 is lower alkyl]. 

5. The N-terminus is derivatized. Typically, the N-terminus may be 
acylated or modified to a substituted amine. Exemplary N-terminal 
derivative groups include -NRR 1 (other than -NH 2 ), -NRC(0)R\ 
-NRCCOOR 1 , -NRStO)^ 1 , -NHCtONHR 1 , succinimide, or 
benzyloxycarbonyl-NH- (CBZ-NH-), wherein R and R 1 are each 
independently hydrogen or lower alkyl and wherein the phenyl ring 
may be substituted with 1 to 3 substituents selected from the group 
consisting of C,-C 4 alkyl, C.-C, alkoxy, chloro, and bromo. 

6. The free C-terminus is derivatized. Typically, the C-terminus is 
esterified or amidated. For example, one may use methods described in 
the art to add (NH-CH 2 -CH 2 -NH 2 ) 2 to compounds of this invention 



to contain one Cys residue and thereby be able to form an 
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having any of SEQ ID NOS: 504 to 508 at the C-terminus. Likewise, 
one may use methods described in the art to add -NH, to compounds 
of this invention having any of SEQ ID NOS: 924 to 955, 963 to 972, 
1005 to 1013, or 1018 to 1023 at the C-terminus. Exemplary C-terminal 
5 derivative groups include, for example, -C(0)R J wherein R 2 is lower 

alkoxy or -NR 3 R 4 wherein R 3 and R 4 are independently hydrogen or C,- 
C g alkyl (preferably C,-C 4 alkyl). 

7. A disulfide bond is replaced with another, preferably more stable, 
cross-linking moiety (e.g., an alkylene). See, e.g., Bhatnagar etaL 

1 o (1996), T- Med. Chem . 39: 3814-9; Alberts etal. (1993) Thirteenth Am. 

Pep. Svmp ., 357-9. 

8. One or more individual amino add residues is modified. Various 
derivatizing agents are known to react specifically with selected 
sidechains or terminal residues, as described in detail below. 

1 5 Lysinyl residues and amino terminal residues may be reacted with 

succinic or other carboxylic acid anhydrides, which reverse the charge of the 
lysinyl residues. Other suitable reagents for derivatizing alpha-amino- 
containing residues include imidoesters such as methyl picolinimidate; 
pyridoxal phosphate; pyridoxal; chloroborohydride; trinitrobenzenesulfonic 

20 acid; O-methylisourea; 2,4 pentanedione; and transaminase-catalyzed reaction 

with glyoxylate. 

Arginyl residues may be modified by reaction with any one or 
combination of several conventional reagents, including phenylglyoxal, 2,3- 
butanedione, 1,2-cyclohexanedione, and ninhydrin. Derivatization of arginyl 
2 5 residues requires that the reaction be performed in alkaline conditions because 
of the high pKa of the guanidine functional group. Furthermore, these reagents 
may react with the groups of lysine as well as the argmine-epsilon-amino - 
group. 
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Specific modification of tyrosyl residues has been studied extensively, 
with particular interest in introducing spectral labels into tyrosyl residues by 
reaction with aromatic diazonium compounds or tetranitromethane. Most 
commonly, N-acetylimidizole and tetranitromethane are used to form O-acetyl 

5 tyrosyl species and 3-nitro derivatives, respectively. 

Carboxyl sidechain groups (aspartyl or glutamyl) may be selectively 
modified by reaction with carbodiimides (R-N=C=N-R') such as 1-cyclohexyl- 
3-(2-morpholinyl-(4-ethyl) carbodiimide or l-ethyl-3-(4-azonia-4,4- 
dimethylpentyl) carbodiimide. Furthermore, aspartyl and glutamyl residues 

10 may be converted to asparaginyl and glutaminyl residues by reaction with 

ammonium ions. 

Glutaminyl and asparaginyl residues may be deamidated to the 
corresponding glutamyl and aspartyl residues. Alternatively, these residues 
are deamidated under mildly acidic conditions. Either form of these residues 
1 5 falls within the scope of this invention. 

Cysteinyl residues can be replaced by amino acid residues or other 
moieties either to eliminate disulfide bonding or, conversely, to stabilize cross- 
linking. See, e.g., Bhatnagar etal. (1996), T-Med.Chem, 39: 3814-9. 

Derealization with bifunctional agents is useful for cross-linking the 
2 0 peptides or their functional derivatives to a water-insoluble support matrix or 
to other macromolecular vehicles. Commonly used cross-linking agents 
include, e.g., l,l-bis(diazoacetyl)-2-phenylethane, glutaraldehyde, N- 
hydroxysuccinimide esters, for example, esters with 4-azidosalicylic acid, 
homobifunctional imidoesters, including disuccinimidyl esters such as 3,3'- 
2 5 dithiobis(succinimidylpropionate), and bifunctional maleimides such as bis-N- 
maleimido-l,8-octane. Derivatizing agents such as methyl-3-[(p- 
azidophenyl)dithio]propioimidate yield photoactivatable intermediates that are 
capable of forming crosslinks in the presence of light. Alternatively, reactive 
water-insoluble matrices such as cyanogen bromide-activated carbohydrates 
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and the reactive substrates described in U.S. Pat. Nos. 3,969,287; 3,691,016; 
4,195,128; 4,247,642; 4,229,537; and 4,330,440 are employed for protein 
immobilization. 

Carbohydrate (oligosaccharide) groups may conveniently be 
5 attached to sites that are known to be glycosylation sites in proteins. 
Generally, O-linked oligosaccharides are attached to serine (Ser) or 
threonine (Thr) residues while N-linked oligosaccharides are attached to 
asparagine (Asn) residues when they are part of the sequence Asn-X- 
Ser/Thr, where X can be any amino acid except proline. X is preferably 

1 o one of the 19 naturally occurring amino acids other than proline. The 

structures of N-linked and O-linked oligosaccharides and the sugar 
residues found in each type are different. One type of sugar that is 
commonly found on both is N-acetylneuraminic acid (referred to as sialic 
acid). Sialic acid is usually the terminal residue of both N-linked and O- 
1 5 linked oligosaccharides and, by virtue of its negative charge, may confer 
acidic properties to the glycosylated compound. Such site(s) may be 
incorporated in the linker of the compounds of this invention and are 
preferably glycosylated by a cell during recombinant production of the 
polypeptide compounds (e.g., in mammalian cells such as CHO, BHK, 

2 0 COS). However, such sites may further be glycosylated by synthetic or 

semi-synthetic procedures known in the art. 

Other possible modifications include hydroxylation of proline and 
lysine, phosphorylation of hydroxyl groups of seryl or threonyl residues, 
oxidation of the sulfur atom in Cys, methylation of the alpha-amino 
2 5 groups of lysine, arginine, and histidine side chains. Creighton, Proteins: 
Structure and Macule Properties (W. H. Freeman & Co., San Francisco), 

pp. 79-86 (1983). ~ 

Compounds of the present invention may be changed at the DNA 
level, as well. The DNA sequence of any portion of the compound may bt 

(,1 
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changed to codons more compatible with the chosen host cell. For Rcoli, 
which is the preferred host cell, optimized codons are known in the art. 
Codons may be substituted to eliminate restriction sites or to include silent 
restriction sites, which may aid in processing of the DN A in the selected 
5 host cell. The vehicle, linker and peptide DNA sequences may be modified 
to include any of the foregoing sequence changes. 
Methods of Making 

The compounds of this invention largely may be made in 
transformed host cells using recombinant DNA techniques. To do so, a 

1 o recombinant DNA molecule coding for the peptide is prepared. Methods 
of preparing such DNA molecules are well known in the art. For instance, 
sequences coding for the peptides could be excised from DNA using 
suitable restriction enzymes. Alternatively, the DNA molecule could be 
synthesized using chemical synthesis techniques, such as the 

1 5 phosphoramidate method. Also, a combination of these techniques could 
be used. 

The invention also includes a vector capable of expressing the 
peptides in an appropriate host. The vector comprises the DNA molecule 
that codes for the peptides operatively linked to appropriate expression 

2 0 control sequences. Methods of effecting this operative linking, either 
before or after the DNA molecule is inserted into the vector, are well 
known. Expression control sequences include promoters, activators, 
enhancers, operators, ribosomal binding sites, start signals, stop signals, 
cap signals, polyadenylation signals, and other signals involved with the 

2 5 control of transcription or translation. 

The resulting vector having the DNA molecule thereon is used to 
transform an appropriate host. This transformation may be performed 
using methods well known in the art. 
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Any of a large number of available and well-known host cells may 
be used in the practice of this invention. The selection of a particular host 
is dependent upon a number of factors recognized by the art. These 
include, for example, compatibility with the chosen expression vector, 
5 toxicity of the peptides encoded by the DNA molecule, rate of 

transformation, ease of recovery of the peptides, expression characteristics, 
bio-safety and costs. A balance of these factors must be struck with the 
understanding that not all hosts may be equally effective for the 
expression of a particular DNA sequence. Within these general guidelines, 

1 0 useful microbial hosts include bacteria (such as E. coli sp.), yeast (such as 
Saccharomvces sp.) and other fungi, insects, plants, mammalian (including 
human) cells in culture, or other hosts known in the art. 

Next, the transformed host is cultured and purified. Host cells may 
be cultured under conventional fermentation conditions so that the 

1 5 desired compounds are expressed. Such fermentation conditions are well 
known in the art. Finally, the peptides are purified from culture by 
methods well known in the art. 

The compounds may also be made by synthetic methods. For 
example, solid phase synthesis techniques may be used. Suitable 

2 0 techniques are well known in the art, and include those described in 
Merrifield (1973), Chem. Polypeptides, pp. 335-61 (Katsoyannis and 
Panayotis eds.); Merrifield (1963), I. Am. Chem. Soc . 85: 2149; Davis etal. 
(1985), Biochem. Intl . 10: 394-414; Stewart and Young (1969), Solid Phase 
Peptide Synthesis; U.S. Pat. No. 3,941,763; Finn etal. (1976), The Proteins 

2 5 (3rd ed.) 2: 105-253; and Erickson etal. (1976), The Proteins (3rd ed.) 2: 
257-527. Solid phase synthesis is the preferred technique of making 
individual peptides since it is the most cost-effective method of making 
small peptides. 
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Compounds that contain derivatized peptides or which contain 
non-peptide groups may be synthesized by well-known organic chemistry 
techniques. 
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Uses of the Compounds 

In general . The compounds of this invention have pharmacologic 
activity resulting from their ability to bind to proteins of interest as 
agonists, mimetics or antagonists of the native ligands of such proteins of 
5 interest. The utility of specific compounds is shown in Table 2. The activity 
of these compounds can be measured by assays known in the art. For the 
TPO-mimetic and EPOmimetic compounds, in vivo assays are further 
described in the Examples section herein. 

In addition to therapeutic uses, the compounds of the present 

1 0 invention are useful in diagnosing diseases characterized by dysfunction 
of their associated protein of interest. In one embodiment, a method of 
detecting in a biological sample a protein of interest (e.g., a receptor) that 
is capable of being activated comprising the steps of: (a) contacting the 
sample with a compound of this invention; and (b) detecting activation of 

1 5 the protein of interest by the compound. The biological samples include 
tissue specimens, intact cells, or extracts thereof. The compounds of this 
invention may be used as part of a diagnostic kit to detect the presence of 
their associated proteins of interest in a biological sample. Such kits 
employ the compounds of the invention having an attached label to allow 

2 0 for detection. The compounds are useful for identifying normal or 
abnormal proteins of interest. For the EPO-mimetic compounds, for 
example, presence of abnormal protein of interest in a biological sample 
may be indicative of such disorders as Diamond Blackfan anemia, where it 
is believed that the EPO receptor is dysfunctional. 

2 5 Therapeutic uses of EPO-mimetic compounds . The EPO-mimetic 

compounds of the invention are useful for treating disorders characterized 
by low red blood cell levels. Included in the invention are methods of 
modulating the endogenous activity of an EPO receptor in a mammal, 
preferably methods of increasing the activity of an EPO receptor. In 

11 
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general, any condition treatable by erythropoietin, such as anemia, may 
also be treated by the EPO-mimetic compounds of the invention. These 
compounds are administered by an amount and route of delivery that is 
appropriate for the nature and severity of the condition being treated and 
5 may be ascertained by one skilled in the art. Preferably, administration is 
by injection, either subcutaneous, intramuscular, or intravenous. 

Therapeutic uses of TPO-mimetic co mpounds. FortheTPO 
mimetic compounds, one can utilize such standard assays as those 
described in W095/26746 entitled "Compositions and Methods for 
1 o Stimulating Megakaryocyte Growth and Differentiation". In vivo assays 
also appear in the Examples hereinafter. 

The conditions to be treated are generally those that involve an 
existing megakaryocyte/ platelet deficiency or an expected 
megakaryocyte/platelet deficiency (e.g., because of planned surgery or 
1 5 platelet donation). Such conditions will usually be the result of a 

deficiency (temporary or permanent) of active Mpl ligand in vivo . The 
generic term for platelet deficiency is thrombocytopenia, and hence the 
methods and compositions of the present invention are generally available 
for treating thrombocytopenia in patients in need thereof. 
2 0 Thrombocytopenia (platelet deficiencies) may be present for 

various reasons, including chemotherapy and other therapy with a variety 
of drugs, radiation therapy, surgery, accidental blood loss, and other 
specific disease conditions. Exemplary specific disease conditions that 
involve thrombocytopenia and may be treated in accordance with this 
2 5 invention are: aplastic anemia, idiopathic thrombocytopenia, metastatic 
tumors which result in thrombocytopenia, systemic lupus erythematosus, 
splenomegaly, Fanconi's syndrome, vitamin B12 deficiency, folic acid 
deficiency, May-Hegglin anomaly, Wiskott-Aldrich syndrome, and 
paroxysmal nocturnal hemoglobinuria. Also, certain treatments for AIDS 
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result in thrombocytopenia (e.g., AZT). Certain wound healing disorders 
might also benefit from an increase in platelet numbers. 

With regard to anticipated platelet deficiencies, e.g., due to future 
surgery, a compound of the present invention could be administered 
5 several days to several hours prior to the need for platelets. With regard 
to acute situations, e.g., accidental and massive blood loss, a compound of 
this invention could be administered along with blood or purified 
platelets. 

The TPO-mimetic compounds of this invention may also be useful in 

1 o stimulating certain cell types other than megakaryocytes if such cells are found 

to express Mpl receptor. Conditions associated with such cells that express the 
Mpl receptor, which are responsive to stimulation by the Mpl ligand, are also 
within the scope of this invention. 

The TPO-mimetic compounds of this invention may be used in any 
1 5 situation in which production of platelets or platelet precursor cells is desired, 
or in which stimulation of the c-Mpl receptor is desired. Thus, for example, the 
compounds of this invention may be used to treat any condition in a mammal 
wherein there is a need of platelets, megakaryocytes, and the like. Such 
conditions are described in detail in the following exemplary sources: 

2 0 W095 /26746; W095/21919; W095/ 18858; WO95/21920 and are incorporated 

herein. 

The TPO-mimetic compounds of this invention may also be useful in 
maintaining the viability or storage life of platelets and/or megakaryocytes and 
related cells. Accordingly, it could be useful to include an effective amount of 
2 5 one or more such compounds in a composition containing such cells. 

The therapeutic methods, compositions and compounds of the 
present invention may also be employed, alone or in combination with 
other cytokines, soluble Mpl receptor, hematopoietic factors, interleukins, 
growth factors or antibodies in the treatment of disease states 



WO 00/24782 



PCTAJS99/25044 



characterized by other symptoms as well as platelet deficiencies. It is 
anticipated that the inventive compound will prove useful in treating 
some forms of thrombocytopenia in combination with general stimulators 
of hematopoiesis, such as IL-3 or GM-CSF. Other megakaryocyte 
5 stimulatory factors, i.e., meg-CSF, stem ceU factor (SCF), leukemia 
inhibitory factor (LIF), oncostatin M (OSM), or other molecules with 
megakaryocyte stimulating activity may also be employed with Mpl 
ligand. Additional exemplary cytokines or hematopoietic factors for such 
co-administration include IL-1 alpha, IL-1 beta, IL-2, IL-3, IL-4, IL-5, IL-6, 

1 o IL-11, colony stimulating factor-1 (CSF-1), SCF, GM-CSF, granulocyte 

colony stimulating factor (G-CSF), EPO, interferon-alpha (IFN-alpha), 
consensus interferon, IFN-beta, or IFN-gamma. It may further be useful to 
administer, either simultaneously or sequentially, an effective amount of a 
soluble mammalian Mpl receptor, which appears to have an effect of 
1 5 causing megakaryocytes to fragment into platelets once the 

megakaryocytes have reached mature form. Thus, administration of an 
inventive compound (to enhance the number of mature megakaryocytes) 
followed by administration of the soluble Mpl receptor (to inactivate the 
ligand and allow the mature megakaryocytes to produce platelets) is 

2 0 expected to be a particularly effective means of stimulating platelet 

production. The dosage recited above would be adjusted to compensate 
for such additional components in the therapeutic composition. Progress 
of the treated patient can be monitored by conventional methods. 

In cases where the inventive compounds are added to compositions 
25 of platelets and/or megakaryocytes and related cells, the amount to be 
included will generally be ascertained experimentally by techniques and 
assays known in the art. An exemplary range of amounts is-0.1 ug— 1 mg 
inventive compound per 10* cells. 
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Pharmaceutical Compositions 

In General . The present invention also provides methods of using 
pharmaceutical compositions of the inventive compounds. Such 
pharmaceutical compositions may be for administration for injection, or for 
5 oral, pulmonary, nasal, transdermal or other forms of administration. In 

general, the invention encompasses pharmaceutical compositions comprising 
effective amounts of a compound of the invention together with 
pharmaceutically acceptable diluents, preservatives, solubilizers, emulsifiers, 
adjuvants and /or carriers. Such compositions include diluents of various 

1 o buffer content (e.g., Tris-HCl, acetate, phosphate), pH and ionic strength; 

additives such as detergents and solubilizing agents (e.g., Tween 80, 
Polysorbate 80), antioxidants (e.g., ascorbic acid, sodium metabisulfite), 
preservatives (e.g., Thimersol, benzyl alcohol) and bulking substances (e.g., 
lactose, mannitol); incorporation of the material into particulate preparations of 
1 5 polymeric compounds such as polylactic acid, polyglycolic acid, etc. or into 
liposomes. Hyaluronic acid may also be used, and this may have the effect of 
promoting sustained duration in the circulation. Such compositions may 
influence the physical state, stability, rate of in vivo release, and rate of in vivo 
clearance of the present proteins and derivatives. See, e.g., Remington's 

2 0 Pharmaceutical Sciences, 18th Ed. (1990, Mack Publishing Co., Easton, PA 

18042) pages 1435-1712 which are herein incorporated by reference. The 
compositions may be prepared in liquid form, or may be in dried powder, such 
as lyophilized form. Implantable sustained release formulations are also 
contemplated, as are transdermal formulations. 
2 5 Oral dosage forms . Contemplated for use herein are oral solid 

dosage forms, which are described generally in Chapter 89 of Remington s 
PharmarPiiHral Sciences (1990), 18th Ed., Mack Publishing Co. Easton PA 
18042, which is herein incorporated by reference. Solid dosage forms 
include tablets, capsules, pills, troches or lozenges, cachets or pellets. Also, 
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liposomal or proteinoid encapsulation may be used to formulate the 
present compositions (as, for example, proteinoid microspheres reported 
in U.S. Patent No. 4,925,673). Liposomal encapsulation may be used and 
the liposomes may be derivatized with various polymers (e.g., US. Patent 
5 No. 5,013,556). A description of possible solid dosage forms for the 

therapeutic is given in Chapter 10 of Marshall, K., Modern Pharmaceutics 
(1979), edited by G. S. Banker and C. T. Rhodes, herein incorporated by 
reference. In general, the formulation will include the inventive 
compound, and inert ingredients which allow for protection against the 

1 o stomach environment, and release of the biologically active material in the 

intestine. 

Also specifically contemplated are oral dosage forms of the above 
inventive compounds. If necessary, the compounds may be chemically 
modified so that oral delivery is efficacious. Generally, the chemical 
1 5 modification contemplated is the attachment of at least one moiety to the 
compound molecule itself, where said moiety permits (a) inhibition of 
proteolysis; and (b) uptake into the blood stream from the stomach or 
intestine. Also desired is the increase in overall stability of the compound 
and increase in circulation time in the body. Moieties useful as covalently 

2 0 attached vehicles in this invention may also be used for this purpose. 

Examples of such moieties include: PEG, copolymers of ethylene glycol 
and propylene glycol, carboxymethyl cellulose, dextran, polyvinyl alcohol, 
polyvinyl pyrrolidone and polyproline. See, for example, Abuchowski and 
Davis, Soluble Polvippr-F.nzvme A bducts. Enzymes as Drugs (1981), 
2 5 Hocenberg and Roberts, eds., Wiley-Interscience, New York, NY, , pp 367- 
83; Newmark, etal. (1982), T Appl. Biochem. 4:185-9. Other polymers that 
could be used are poly-l,3-dioxolane and poly-l,3,6-tioxocane. Preferred 
for pharmaceutical usage, as indicated above, are PEG moieties. 



It 
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For oral delivery dosage forms, it is also possible to use a salt of a 
modified aliphatic amino acid, such as sodium N-(8-[2-hydroxybenzoyl] 
amino) caprylate (SNAC), as a carrier to enhance absorption of the 
therapeutic compounds of this invention. The clinical efficacy of a heparin 
5 formulation using SNAC has been demonstrated in a Phase II trial 

conducted by Emisphere Technologies. See US Patent No. 5,792,451, "Oral 
drug delivery composition and methods". 

The compounds of this invention can be included in the 
formulation as fine multiparticulates in the form of granules or pellets of 
10 particle size about 1 mm. The formulation of the material for capsule 
administration could also be as a powder, lightly compressed plugs or 
even as tablets. The therapeutic could be prepared by compression. 

Colorants and flavoring agents may all be included. For example, 
the protein (or derivative) may be formulated (such as by liposome or 
1 5 microsphere encapsulation) and then further contained within an edible 
product, such as a refrigerated beverage containing colorants and 

flavoring agents. 

One may dilute or increase the volume of the compound of the 
invention with an inert material. These diluents could include 
20 carbohydrates, especially mannitol, a-lactose, anhydrous lactose, cellulose, 
sucrose, modified dextrans and starch. Certain inorganic salts may also be 
used as fillers including calcium triphosphate, magnesium carbonate and 
sodium chloride. Some commercially available diluents are Fast-Flo, 
Emdex, STA-Rx 1500, Emcompress and Avicell. 
2 5 Disintegrants may be included in the formulation of the therapeutic 

into a solid dosage form. Materials used as disintegrants include but are 
not limited to starch including the commercial disintegrant based on 
starch, Explotab. Sodium starch glycolate, Amberlite, sodium 
carboxymethylcellulose, ultramylopectin, sodium alginate, gelatin, orange 

79 
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10 



peel acid carboxymethyl cellulose, natural sponge and bentonite may all 
be used. Another form of the disintegrants are the insoluble cationic 
exchange resins. Powdered gums may be used as disintegrants and as 
binders and these can include powdered gums such as agar, Karaya or 
tragacanth. Alginic acid and its sodium salt are also useful as 
disintegrants. 

Binders may be used to hold the therapeutic agent together to form 
a hard tablet and include materials from natural products such as acacia, 
tragacanth, starch and gelatin. Others include methyl cellulose (MC), ethyl 
cellulose (EC) and carboxymethyl cellulose (CMC). Polyvinyl pyrrolidone 
(PVP) and hydroxypropylmethyl cellulose (HPMC) could both be used in 
alcoholic solutions to granulate the therapeutic. 

An antifrictional agent may be included in the formulation of the 
therapeutic to prevent sticking during the formulation process. Lubricants 
m ay be used as a layer between the therapeutic and the die wall, and these 
can include but are not limited to; stearic acid including its magnesium 
and calcium salts, polytetrafluoroethylene (PTFE), liquid paraffin, 
vegetable oils and waxes. Soluble lubricants may also be used such as 
sodium lauryl sulfate, magnesium lauryl sulfate, polyethylene glycol of 
various molecular weights, Carbowax 4000 and 6000. 

GUdants that might improve the flow properties of the drug dunng 
formulation and to aid rearrangement during compression might be 
added. The glidants may include starch, talc, pyrogenic silica and 
hydrated silicoaluminate. 
5 To aid dissolution of the compound of this invention into the 

aqueous environment a surfactant might be added as a wetting agent. 
Surfactants may include anionic detergents such as sodiumiauryl sulfate, 
dioctyl sodium sulf osuccinate and dioctyl sodium sulfonate. Canonic 
detergents might be used and could include benzalkonium chloride or 

80 
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tab*, chloride. The Us. of potential nonionic detergents that 
could be included in the formulation as surface* are lauromacrogo. 400. 
polyoxyl 40 stearate, polyoxyethylene hydrogenated castor oil 10 50 and 
60 glycerol monostearate, polysorbate 40, 60, 65 and 80. sucrose fatty actd 
, ester.medrylceUuloseandcarboxymethylceUu.ose. These surfacten* 
could be present in me formulation of me protein or derivative either 
alone or as a mixture in different ratios. 

Additives may also be included in the formulation to enhance 
uptake of me compound. Additives potentially having mis property are 
„ for instance the fatty acids oleic acid, linoleic acid and linolenic acid. 

Controlled release formulation may be desirable. The compound of 
Ais invention could be incorporated into an inert matrix which penruts 
release by either diffusion or leaching mechanisms e.g., gums. Slowly 
degenerating matrices may also be incorporated into the formulation, e.g., 
« alginates, polysaccharides. Another form of a controlled release of the 

compounds of mis invention is by a method based on the Oros therapeutic 
sys ,em (Alza Corp.), i.e.. the dmg is enclosed in a semipermeable 
membrane which allows water to enter and push drug out through a 
single small opening due to osmotic effecte. Some enteric coatings also 
20 have a delayed release effect. 

Oner coatings may be used for the formulation. These mdude a 
variety of sugars which could be applied in a coating pan. The therapeutic 
ag e„ t could also be given in a film coated table, and the materials used m 
.his instance are divided into 2 groups. The first are ,he nonenteric 
25 materials and include methyl cellulose, ethyl cellulose, hydroxyemy. 
cellulose, memylhydroxy-ethyl cellulose, hydroxypropyl cellulose, 
hydroxypropyl-memyl cellulose, sodium carboxy-methyl cellulose, 
Povidone and the polyethylene glycols. The second group cons,s.s of .he 
enteric materials that are commonly esters of phthalic acid. 

i\ 



WO 00/24782 



PCT/US99/25044 



A mix of materials might be used to provide the optimum film 
coating. Film coating may be carried out in a pan coater or in a fluidized 
bed or by compression coating. 

Pulmonary delivery forms . Also contemplated herein is pulmonary 
5 delivery of the present protein (or derivatives thereof). The protein (or 
derivative) is delivered to the lungs of a mammal while inhaling and 
traverses across the lung epithelial lining to the blood stream. (Other 
reports of this include Adjei etal., Pharma. Res . (1990) 7: 565-9; Adjei et al . 
(1990), Internatl. I. Pharmaceutics 63: 135-44 (leuprolide acetate); Braquet 

1 0 etal. (1989), T. Cardiovasc. Pharmacol . 13 (suppl.5): s.143-146 (endothelin- 
1); Hubbard et_al. (1989), Annals Int. Med . 3: 206-12 (al-antitrypsin); Smith 
et al . (1989), T. Clin. Invest 84: 1145-6 (al -proteinase); Oswein et al . (March 
1990), "Aerosolization of Proteins", Proc. Symp. Resp. Drug Delivery PL 
Keystone, Colorado (recombinant human growth hormone); Debs et al . 

1 5 (1988), I. Immunol . 140: 3482-8 (interferon^ and tumor necrosis factor a) 
and Platz etal., U.S. Patent No. 5,284,656 (granulocyte colony stimulating 
factor). 

Contemplated for use in the practice of this invention are a wide 
range of mechanical devices designed for pulmonary delivery of 

2 0 therapeutic products, including but not limited to nebulizers, metered 
dose inhalers, and powder inhalers, all of which are familiar to those 
skilled in the art. Some specific examples of commercially available 
devices suitable for the practice of this invention are the Ultravent 
nebulizer, manufactured by Mallinckrodt, Inc., St. Louis, Missouri; the 

2 5 Acorn II nebulizer, manufactured by Marquest Medical Products, 

Englewood, Colorado; the Ventolin metered dose inhaler, manufactured 
by Glaxo Inc., Research Triangle Park, North Carolina; andlhe Spinhaler 
powder inhaler, manufactured by Fisons Corp., Bedford, Massachusetts. 
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All such devices require the use of formulations suitable for the 
dispensing of the inventive compound. Typically, each formulation is 
specific to the type of device employed and may involve the use of an 
appropriate propellant material, in addition to diluents, adjuvants 
5 and/or carriers useful in therapy. 

The inventive compound should most advantageously be 
prepared in particulate form with an average particle size of less than 10 
jim (or microns), most preferably 0.5 to 5 (im, for most effective delivery 
to the distal lung. 

1 0 Pharmaceutical^ acceptable carriers include carbohydrates such 

as trehalose, mannitol, xylitol, sucrose, lactose, and sorbitol. Other 
ingredients for use in formulations may include DPPC, DOPE, DSPC and 
DOPC. Natural or synthetic surfactants may be used. PEG may be used 
(even apart from its use in derivatizing the protein or analog). Dextrans, 

1 5 such as cyclodextran, may be used. Bile salts and other related enhancers 
may be used. Cellulose and cellulose derivatives may be used. Amino 
acids may be used, such as use in a buffer formulation. 

Also, the use of liposomes, microcapsules or microspheres, 
inclusion complexes, or other types of carriers is contemplated. 

2 0 Formulations suitable for use with a nebulizer, either jet or 

ultrasonic, will typically comprise the inventive compound dissolved in 
water at a concentration of about 0.1 to 25 mg of biologically active protein 
per mL of solution. The formulation may also include a buffer and a 
simple sugar (e.g., for protein stabilization and regulation of osmotic 

2 5 pressure). The nebulizer formulation may also contain a surfactant, to 
reduce or prevent surface induced aggregation of the protein caused by 
atomization of the solution in forming the aerosol. 

Formulations for use with a metered-dose inhaler device will 
generally comprise a finely divided powder containing the inventive 
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compound suspended in a propellant with the aid of a surfactant. The 
propellant may be any conventional material employed for this purpose, 
such as a chlorofluorocarbon, a hydrochlorofluorocarbon, a 
hydrofluorocarbon, or a hydrocarbon, including trichlorofluoromethane, 
5 dichlorodifluoromethane, dichlorotetrafluoroethanol, and 1,1,1,2- 

tetrafluoroethane, or combinations thereof. Suitable surfactants include 
sorbitan trioleate and soya lecithin. Oleic acid may also be useful as a 
surfactant. 

Formulations for dispensing from a powder inhaler device will 

1 0 comprise a finely divided dry powder containing the inventive compound 
and may also include a bulking agent, such as lactose, sorbitol, sucrose, 
mannitol, trehalose, or xylitol in amounts which facilitate dispersal of the 
powder from the device, e.g., 50 to 90% by weight of the formulation. 

Nasal delivery forms . Nasal delivery of the inventive compound is 

1 5 also contemplated. Nasal delivery allows the passage of the protein to the 
blood stream directly after administering the therapeutic product to the 
nose, without the necessity for deposition of the product in the lung. 
Formulations for nasal delivery include those with dextran or 
cyclodextran. Delivery via transport across other mucous membranes is 

2 0 also contemplated. 

Dosages . The dosage regimen involved in a method for treating the 
above-described conditions will be determined by the attending physician, 
considering various factors which modify the action of drugs, e.g. the age, 
condition, body weight, sex and diet of the patient, the severity of any infection, 

2 5 time of administration and other clinical factors. Generally, the daily regimen 
should be in the range of 0.1-1000 micrograms of the inventive compound per 
kilogram of body weight, preferably 0.1-150 micrograms per kilogram. 
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Specific preferred embodiments 

The inventors have determined preferred peptide sequences for 
molecules having many different kinds of activity. The inventors have 
further determined preferred structures of these preferred peptides 
5 combined with preferred linkers and vehicles. Preferred structures for 
these preferred peptides listed in Table 21 below. 



Table 21— Preferred embodiments 



Sequence/structure 


SEQ 
ID 

NO: 


Activity 




337 


TPOmimetic 


IEGPTLRQWUVARA-(G)JEGPTLRQWLAARA-(G),- F 1 


338 


TPO-mimetic 


F'-fG* -IEGPTLROWLAARA 


1032 


l rO-mlmeilC 


IEGPTLRQWLAARA -(G),- F' 


1033 


TPOmimetic 


F-(G) S -GGTYSCHFGPLTWVCKPQGG-(G) 4 - 
GGTYSCHFGPLTWVCKPQGG 


339 


EPO-mimetic 


GGTYSCHFGPLTWVCKPQGG-(G),- 
GGTYSCHFGPLTWVCKPQGG-(G),-F' 


340 


EPOmimetic 


GGTYSCHFGPLTWVCKPGGG-(G) 5 -F 1 


1034 


EPO-mimetic 


F'-(G),-DFLPHYKNTSLGHRP 


1045 


TNF-a inhibitor 


DFLPHYKNTSLGHRP-(G) S -F' 


1046 


TNF-a inhibitor 


F'-(G),- FEWTPGYWQPYALPL 


1047 


IL-1 R antagonist 


FEWTPGYWQPYALPL-(G),-F' 


1048 


IL-1 R antagonist 


F'-(G) 5 -VEPNCDIHVMWEWECFERL 


1049 


VEGF-antagonist 


VEPNCDIHVMWEWECFERL-(G) 5 -F* 


1050 


VEGF-antagonist 


F'-(G) S -CTTHWGFTLC 


1051 


MMP inhibitor 


CTTHWGFTLC-(G).-F' 


1052 


MMP inhibitor 



"F 1 " is an Fc domain as defined previously herein. 



Working examples 
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The compounds described above may be prepared as described 
below. These examples comprise preferred embodiments of the invention 
and are illustrative rather than limiting. 

Example 1 

5 TPO-Mimetics 

The following example uses peptides identified by the numbers 
appearing in Table A hereinafter. 

Preparation of peptide 19 . Peptide 17b (12 mg) and MeO-PEG-SH 
5000 (30 mg, 2 equiv.) were dissolved in 1 ml aqueous buffer (pH 8). The 

1 0 mixture was incubated at RT for about 30 minutes and the reaction was 
checked by analytical HPLC, which showed a > 80% completion of the 
reaction. The pegylated material was isolated by preparative HPLC. 

Preparation of peptide 20 . Peptide 18 (14 mg) and MeO-PEG- 
maleimide (25 mg) were dissolved in about 1.5 ml aqueous buffer (pH 8). 

1 5 The mixture was incubated at RT for about 30 minutes, at which time 
about 70% transformation was complete as monitored with analytical 
HPLC by applying an aliquot of sample to the HPLC column. The 
pegylated material was purified by preparative HPLC. 

Bioactivity assay . The TPO in vitro bioassay is a mitogenic assay 

2 0 utilizing an IL-3 dependent clone of murine 32D cells that have been 

transfected with human mpl receptor. This assay is described in greater 
detail in WO 95/26746. Cells are maintained in MEM medium containing 
10% Fetal Clone II and 1 ng/ml mIL-3. Prior to sample addition, cells are 
prepared by rinsing twice with growth medium lacking mIL-3. An 

2 5 extended twelve point TPO standard curve is prepared, ranging from 33 
to 39 pg/ml. Four dilutions, estimated to fall within the linear portion of 
the standard curve, (100 to 125 pg/ml), are prepared for each sample and 
run in triplicate. A volume of 100 jil of each dilution of sample or 
standard is added to appropriate wells of a 96 well microtiter plate 
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containing 10,000 cells/well. After forty-four hours at 37 °C and 10% C0 2/ 
MTS (a tetrazolium compound which is bioreduced by cells to a formazan) 
is added to each well. Approximately six hours later, the optical density is 
read on a plate reader at 490 nm. A dose response curve (log TPO 
5 concentration vs. O.D.- Background) is generated and linear regression 
analysis of points which fall in the linear portion of the standard curve is 
performed. Concentrations of unknown test samples are determined 
using the resulting linear equation and a correction for the dilution factor. 
TMP tandem repeats with polvglvcine linkers . Our design of 

1 0 sequentially linked TMP repeats was based on the assumption that a 

dimeric form of TMP was required for its effective interaction with c-Mpl 
(the TPO receptor) and that depending on how they were wound up 
against each other in the receptor context, the two TMP molecules could 
be tethered together in the C- to N-terminus configuration in a way that 

15 would not perturb the global dimeric conformation. Clearly, the success 
of the design of tandem linked repeats depends on proper selection of the 
length and composition of the linker that joins the C- and N-termini of the 
two sequentially aligned TMP monomers. Since no structural information 
of the TMP bound to c-Mpl was available, a series of repeated peptides 

2 0 with linkers composed of 0 to 10 and 14 glycine residues (Table A) were 
synthesized. Glycine was chosen because of its simplicity and flexibility, 
based on the rationale that a flexible polyglycine peptide chain might 
allow for the free folding of the two tethered TMP repeats into the 
required conformation, while other amino acid sequences may adopt 

2 5 undesired secondary structures whose rigidity might disrupt the correct 
packing of the repeated peptide in the receptor context. 

The resulting peptides are readily accessible by conventional solid 
phase peptide synthesis methods (Merrifield (1963), T. Amer. Chem. Soc . 
85: 2149) with either Fmoc or t-Boc chemistry. Unlike the synthesis of the 
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C-terminally linked parallel dimer which required the use of an 
orthogonally protected lysine residue as the initial branch point to build 
the two peptide chains in a pseudosymmetrical way (Cwirla et al . (1997), 
Science 276: 1696-9), the synthesis of these tandem repeats was a 
5 straightforward, stepwise assembly of the continuous peptide chains from 
the C- to N-terminus. Since dimerization of TMP had a more dramatic 
effect on the proliferative activity than binding affinity as shown for the C- 
terminal dimer (Cwirla etal. (1997)), the synthetic peptides were tested 
directly for biological activity in a TPO-dependent cell-proliferation assay 

1 0 using an IL-3 dependent clone of murine 32D cells transfected with the 
full-length c-Mpl (Palacios etal.,. Cell 41:727 (1985)). As the test results 
showed, all the polyglycine linked tandem repeats demonstrated >1000 
fold increases in potency as compared to the monomer, and were even 
more potent than the C-terminal dimer in this cell proliferation assay. The 

1 5 absolute activity of the C-terminal dimer in our assay was lower than that 
of the native TPO protein, which is different from the previously reported 
findings in which the C-terminal dimer was found to be as active as the 
natural ligand (Cwirla etal. (1997)). This might be due to differences in 
the conditions used in the two assays. Nevertheless, the difference in 

2 0 activity between tandem (C terminal of first monomer linked to N 

terminal of second monomer) and C-terminal (C terminal of first monomer 
linked to C terminal of second monomer; also referred to as parallel) 
dimers in the same assay clearly demonstrated the superiority of tandem 
repeat strategy over parallel peptide dimerization. It is interesting to note 

2 5 that a wide range of length is tolerated by the linker. The optimal linker 
between tandem peptides with the selected TMP monomers apparently is 
composed of 8 glycines. "7 

Other tandem repeats . Subsequent to this first series of TMP 
tandem repeats, several other molecules were designed either with 
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different linkers or containing modifications within the monomer itself. 
The first of these molecules, peptide 13, has a linker composed of GPNG, a 
sequence known to have a high propensity to form a p-tum-type 
secondary structure. Although still about 100-fold more potent than the 
5 monomer, this peptide was found to be >10-fold less active than the 
equivalent GGGG-linked analog. Thus, introduction of a relatively rigid 
P-turn at the linker region seemed to have caused a slight distortion of the 
optimal agonist conformation in this short linker form. 

The Trp9 in the TMP sequence is a highly conserved residue among 

10 the active peptides isolated from random peptide libraries. There is also a 
highly conserved Trp in the consensus sequences of EPO mimetic peptides 
and this Trp residue was found to be involved in the formation of a 
hydrophobic core between the two EMPs and contributed to hydrophobic 
interactions with the EPO receptor. Livnah e^al. (1996), Science 273: 464- 

15 71). By analogy, the Trp9 residue in TMP might have a similar function in 
dimerization of the peptide ligand, and as an attempt to modulate and 
estimate the effects of noncovalent hydrophobic forces exerted by the two 
indole rings, several analogs were made resulting from mutations at the 
Trp. So in peptide 14, the Trp residue was replaced in each of the two 

2 0 TMP monomers with a Cys, and an intramolecular disulfide bond was 
formed between the two cysteines by oxidation which was envisioned to 
mimic the hydrophobic interactions between the two Trp residues in 
peptide dimerization. Peptide 15 is the reduced form of peptide 14. In 
peptide 16, the two Trp residues were replaced by Ala. As the assay data 

2 5 show, all three analogs were inactive. These data further demonstrated 
that Trp is critical for the activity of the TPO mimetic peptide, not just for 
dimer formation. - 

The next two peptides (peptide 17a, and 18) each contain in their 8- 
amino acid linker a Lys or Cys residue. These two compounds are 
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precursors to the two PEGylated peptides (peptide 19 and 20) in which the 
side chain of the Lys or Cys is modified by a PEG moiety. A PEG moiety 
was introduced at the middle of a relatively long linker, so that the large 
PEG component (5 kDa) is far enough away from the critical binding sites 
5 in the peptide molecule. PEG is a known biocompatible polymer which is 
increasingly used as a covalent modifier to improve the pharmacokinetic 
profiles of peptide- and protein-based therapeutics. 

A modular, solution-based method was devised for convenient 
PEGylation of synthetic or recombinant peptides. The method is based on 

10 the now well established chemoselective ligation strategy which utilizes 
the specific reaction between a pair of mutually reactive functionalities. 
So, for pegylated peptide 19, the lysine side chain was preactivated with a 
bromoacetyl group to give peptide 17b to accommodate reaction with a 
thiol-derivatized PEG. To do that, an orthogonal protecting group, Dde, 

1 5 was employed for the protection of the lysine e-amine. Once the whole 
peptide chain was assembled, the N-terminal amine was reprotected with 
t-Boc. Dde was then removed to allow for the bromoacetylation. This 
strategy gave a high quality crude peptide which was easily purified using 
conventional reverse phase HPLC. Ligation of the peptide with the thiol- 

2 0 modified PEG took place in aqueous buffer at pH 8 and the reaction 
completed within 30 minutes. MALDI-MS analysis of the purified, 
pegylated material revealed a characteristic, bell-shaped spectrum with an 
increment of 44 Da between the adjacent peaks. For PEG-peptide 20, a 
cysteine residue was placed in the linker region and its side chain thiol 

2 5 group would serve as an attachment site for a maleimide-containing PEG. 
Similar conditions were used for the pegylation of this peptide. As the 
assay data revealed, these two pegylated peptides had evefT higher in vitro 
bioactivity as compared to their unpegylated counterparts. 
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Peptide 21 has in its 8-amino acid linker a potential glycosyiation 
motif, NGS. Since our exemplary tandem repeats are made up of natural 
amino acids linked by peptide bonds, expression of such a molecule in an 
appropriate eukaryotic cell system should produce a glycopeptide with 
5 the carbohydrate moiety added on the side chain carboxyamide of Asn. 
Glycosyiation is a common post-translational modification process which 
can have many positive impacts on the biological activity of a given 
protein by increasing its aqueous solubility and in vivo stability. As the 
assay data show, incorporation of this glycosyiation motif into the linker 

1 0 maintained high bioactivity . The synthetic precursor of the potential 
glycopeptide had in effect an activity comparable to that of the -(G) 8 - 
linked analog. Once glycosylated, this peptide is expected to have the 
same order of activity as the pegylated peptides, because of the similar 
chemophysical properties exhibited by a PEG and a carbohydrate moiety. 

1 5 The last peptide is a dimer of a tandem repeat. It was prepared by 

oxidizing peptide 18, which formed an intermolecular disulfide bond 
between the two cysteine residues located at the linker. This peptide was 
designed to address the possibility that TMP was active as a tetramer. The 
assay data showed that this peptide was not more active than an average 

2 0 tandem repeat on an adjusted molar basis, which indirectly supports the 
idea that the active form of TMP is indeed a dimer, otherwise dimerization 
of a tandem repeat would have a further impact on the bioactivity. 

In order to confirm the in vitro data in animals, one pegylated TMP 
tandem repeat (compound 20 in Table A) was delivered subcutaneously to 

2 5 normal mice via osmotic pumps. Time and dose-dependent increases 

were seen in platelet numbers for the duration of treatment. Peak platelet 
levels over 4-fold baseline were seen on day 8. A dose of lCTjag/kg/day of - 
the pegylated TMP repeat produced a similar response to rHuMGDF 
(non-pegylated) at 100 ng/kg/day delivered by the same route. 
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Table A— TPO-mimetic Peptides 



Peptide 
No. 



Compound 



SEQID Relative 
NO: Potency 



TPO 

TMP monomer 
TMP C-C dimer 
TMP-(G)„-TMP: 



1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

11 

12 

13 

14 

15 

16 

17a 

17b 

18 

19 

20 

21 

22 



n = 0 
n = 1 
n = 2 
n = 3 
n = 4 
n = 5 
n = 6 
n = 7 
n = 8 
n = 9 
n = 10 
n = 14 

TMP-GPNG-TMP 

IEGPTLRQCLAARA-GGGGGGGG-IEGPTLRQCLAARA 
I 



(cyclic) 

IEGPTLRQCLAARA-GGGGGGGG- 

lEGPTLRQfiLAARA (linear) 

IEGPTLRQA.LAARA-GGGGGGGG- 

IEGPTLRQALAARA 

TMP-GGGKGGGG-TMP 

TMP-GGGK(BrAc)GGGG-TMP 

TMP-GGGCGGGG-TMP 

TMP-GGGK(PEG)GGGG-TMP 

TMP-GGGC(PEG)GGGG-TMP 

TMP-GGGN*GSGG-TMP 

TMP-GGGCGGGG-TMP 
I 

TMP-GGGCGGGG-TMP 



13 



341 
342 
343 
344 
345 
346 
347 
348 
349 
350 
351 
352 
353 
354 

355 

356 

357 
358 
359 
360 
361 
362 
363- 

363 



++++- 



++++ 



+++ 



ND 



+++++ 
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Discussion . It is well accepted that MGDF acts in a way similar to 
hGH, i.e., one molecule of the protein ligand binds two molecules of the 
receptor for its activation. Wells et al .(1996), Ann. Rev. Biochem . 65: 609- 
34. Now, this interaction is mimicked by the action of a much smaller 
5 peptide, TMP. However, the present studies suggest that this mimicry 
requires the concerted action of two TMP molecules, as covalent 
dimerization of TMP in either a C-C parallel or C-N sequential fashion 
increased the in vitro biological potency of the original monomer by a 
factor of greater than 10 3 . The relatively low biopotency of the monomer is 
1 0 probably due to inefficient formation of the noncovalent dimer. A 

preformed covalent repeat has the ability to eliminate the entropy barrier 
for the formation of a noncovalent dimer which is exclusively driven by 
weak, noncovalent interactions between two molecules of the small, 14- 
residue peptide. 

15 It is intriguing that this tandem repeat approach had a similar effect 

on enhancing bioactivity as the reported C-C dimerization is intriguing. 
These two strategies brought about two very different molecular 
configurations. The C-C dimer is a quasi-symmetrical molecule, while the 
tandem repeats have no such symmetry in their linear structures. Despite 

2 0 this difference in their primary structures, these two types of molecules 
appeared able to fold effectively into a similar biologically active 
conformation and cause the dimerization and activation of c-Mpl. These 
experimental observations provide a number of insights into how the two 
TMP molecules may interact with one another in binding to c-Mpl. First, 

25 the two C-termini of the two bound TMP molecules must be in relatively 
close proximity with each other, as suggested by data on the C-terminal 
dimer. Second, the respective N- and C-termini of the two TMP molecules 
in the receptor complex must also be very closely aligned with each other, 
such that they can be directly tethered together with a single peptide bond 
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to realize the near maximum activity-enhancing effect brought about by 
the tandem repeat strategy. Insertion of one or more (up to 14) glycine 
residues at the junction did not increase (or decrease) significantly the 
activity any further. This may be due to the fact that a flexible polyglycine 
5 peptide chain is able to loop out easily from the junction without causing 
any significant changes in the overall conformation. This flexibility seems 
to provide the freedom of orientation for the TMP peptide chains to fold 
into the required conformation in interacting with the receptor and 
validate it as a site of modification. Indirect evidence supporting this 

1 0 came from the study on peptide 13, in which a much more rigid b-turn- 
forming sequence as the linker apparently forced a deviation of the 
backbone alignment around the linker which might have resulted in a 
slight distortion of the optimal conformation, thus resulting in a moderate 
(10-fold) decrease in activity as compared with the analogous compound 

1 5 with a 4-Gly linker. Third, Trp9 in TMP plays a similar role as Trpl3 in 
EMP, which is involved not only in peptide:peptide interaction for the 
formation of dimers but also is important for contributing hydrophobic 
forces in peptideireceptor interaction. Results obtained with the W to C 
mutant analog, peptide 14, suggest that a covalent disulfide linkage is not 

2 0 sufficient to approximate the hydrophobic interactions provided by the 
Trp pair and that, being a short linkage, it might bring the two TMP 
monomers too close, therefore perturbing the overall conformation of the 
optimal dimeric structure. 



2 5 can provide further understanding on the interaction between TMP and c- 
Mpl. This can be facilitated by making reference to the reported structure 
of the EPO mimetic peptide. Livnah et al . (1996), Science 273:464-75 The 
receptor-bound EMP has a b-hairpin structure with a b-turn formed by the 
highly consensus Gly-Pro-Leu-Thr at the center of its sequence. Instead of 



An analysis of the possible secondary structure of the TMP peptide 
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GPLT, TMP has a highly selected GPTL sequence which is likely to form a 
similar turn. However, this turn-like motif is located near the N-terminal 
part in TMP. Secondary structure prediction using Chau-Fasman method 
suggests that the C-terminal half of the peptide has a tendency to adopt a 
5 helical conformation. Together with the highly conserved Trp at position 
9, this C-terminal helix may contribute to the stabilization of the dimeric 
structure. It is interesting to note that most of our tandem repeats are 
more potent than the C-terminal parallel dimer. Tandem repeats seem to 
give the molecule a better fit conformation than does the C-C parallel 

1 0 dimerization. The seemingly asymmetric feature of a tandem repeat 

might have brought it closer to the natural ligand which, as an asymmetric 
molecule, uses two different sites to bind two identical receptor molecules. 

Introduction of a PEG moiety was envisaged to enhance the in vivo 
activity of the modified peptide by providing it a protection against 

1 5 proteolytic degradation and by slowing down its clearance through renal 
filtration. It was unexpected that pegylation could further increase the m 
vitro bioactivity of a tandem repeated TMP peptide in the cell-based 
proliferation assay. 

Example 2 

20 Fc-TMP fusions 

TMPs (and EMPs as described in Example 3) were expressed in 
either monomelic or dimeric form as either N-terminal or C-terminal 
fusions to the Fc region of human IgGl. In all cases, the expression 
construct utilized the luxPR promoter promoter in the plasmid expression 

25 vector pAMG21. 

Fc-TMP . A DNA sequence coding for the Fc region of human IgGl 
fused in-frame to a monomer of the TPO-mimetic peptide was constructed 
using standard PCR technology. Templates for PCR reactions were the 
pFc- A3 vector and a synthetic TMP gene. The synthetic gene was 
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constructed from the 3 overlapping oligonucleotides (SEQ ID NOS: 364, 
365, and 366, respectively) shown below: 

1842-97 AAA AAA GGA TCC TCG AGA TTA AGC ACG AGC AGC CAG CCA 

CTG ACG CAG AGT CGG ACC 

5 

1842-98 AAA GGT GGA GGT GGT GGT ATC GAA GGT CCG ACT CTG CGT 

1842-99 CAG TGG CTG GCT GCT CGT GCT TAA TCT CGA GGA TCC TTT 

TTT 

10 

These oligonucleotides were annealed to form the duplex encoding an 
amino acid sequence (SEQ ID NOS: 367 and 368, respectively) shown 
below: 

1 5 AAAGGTGGAGGTGGTGGTATCGAAGGTCCGACTCTGCGTCAGTGGCTGGCTGCTCGTGCT 

1 + * + + + + 60 

CCAGGCTGAGACGCAGTCACCGACCGACGAGCACGA 
a KGGGGGIEGPTLRQWLAARA 

20 TAATCTCGAGGATCCTTTTTT 

61 + + - 81 

ATTAGAGCTCCTAGGAAAAAA 
a * 

2 5 This duplex was amplified in a PGR reaction using 1842-98 and 1842-97 as 

the sense and antisense primers. 

The Fc portion of the molecule was generated in a PCR reaction 
with pFc-A3 using the primers shown below (SEQ ID NOS: 369 and 370): 

30 1216-52 AAC ATA AGT ACC TGT AGG ATC G 

1830-51 TTCGATACCA CCACCTCCAC CTTTACCCGG AGACAGGGAG AGGCTCTTCTGC 

The oligonucleotides 1830-51 and 1842-98 contain an overlap of 24 

3 5 nucleotides, allowing the two genes to be fused together in the correct 

reading frame by combining the above PCR products in a third reaction 
using the outside primers, 1216-52 and 1842-97. 

The final PCR gene product (the full length fusion gene) was 
digested with restriction endonucleases Xba l and Bam HI, and then ligated 

4 0 into the vector pAMG21 and transformed into competent E. coli strain 

2596 cells as described for EMP-Fc herein. Clones were screened for the 
ability to produce the recombinant protein product and to possess the 

n 
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gene fusion having the correct nucleotide sequence. A single such clone 
was selected and designated Amgen strain #3728. 

The nucleotide and amino acid sequences (SEQ ID NOS: 5 and 6) of 
the fusion protein are shown in Figure 7. 
5 Fc-TMP-TMP . A DNA sequence coding for the Fc region of human 

IgGl fused in-frame to a dimer of the TPO-mimetic peptide was 
constructed using standard PCR technology. Templates for PCR reactions 
were the pFc-A3 vector and a synthetic TMP-TMP gene. The synthetic 
gene was constructed from the 4 overlapping oligonucleotides (SEQ ID 
1 0 NOS: 371 to 374, respectively) shown below: 

1830-52 AAA GGT GGA GGT GGT GGT ATC GAA GGT CCG 

ACT CTG CGT CAG TGG CTG GCT GCT CGT GCT 

15 1830*53 ACC TCC ACC ACC AGC ACG AGC AGC CAG 

CCA CTG ACG CAG AGT CGG ACC 



20 



1830-54 GGT GGT GGA GGT GGC GGC GGA GGT ATT GAG GGC CCA ACC 

CTT CGC CAA TGG CTT GCA GCA CGC GCA 

1830-55 AAA AAA AGG ATC CTC GAG ATT ATG CGC GTG CTG CAA GCC 

ATT GGC GAA GGG TTG GGC CCT CAA TAC CTC CGC CGC C 



The 4 oligonucleotides were annealed to form the duplex encoding an 

2 5 amino acid sequence (SEQ ID NOS: 375 and 376, respectively) shown 

below: 

AAAGGTGGAGGTGGTGGTATCGAAGGTCCGACTCTGCGTCAGTGGCTGGCTGCTCGTGCT 
1 + + + + + + 60 

3 0 C CAGGC TGAG AC GC AGTC ACC GACCGAC GAGC AC G A 

a KGGGGGI EGPTLRQWLAARA 

GGTGGTGGAGGTGGCGGCGGAGGTATTGAGGGCCCAACCCTTCGCCAATGGCTTGCAGCA 
61 + + + + + + 120 

3 5 CCACCACCTCCACCGCCGCCTCCATAACTCCCGGGTTGGGAAGCGGTTACCGAACGTCGT 

a GGGGGGGGIEGPTLRQWLAA 
CGC GCA 

121 148 

4 0 GC GC GTATT AGAGC TCC TAGGAAAAAAA 

a R A * - 

This duplex was amplified in a PCR reaction using 1830-52 and 1830-55 as 
45 the sense and antisense primers. 

The Fc portion of the molecule was generated in a PCR reaction 
with pFc-A3 using the primers 1216-52 and 1830-51 as described above for 

j 7 
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Fc-TMP. The full length fusion gene was obtained from a third PCR 
reaction using the outside primers 1216-52 and 1830-55. 

The final PCR gene product (the full length fusion gene) was 
digested with restriction endonucleases Xba l and BamHL and then ligated 
5 into the vector pAMG21 and transformed into competent E. coli strain 
2596 cells as described in example 1. Clones were screened for the ability 
to produce the recombinant protein product and to possess the gene 
fusion having the correct nucleotide sequence. A single such clone was 
selected and designated Amgen strain #3727. 

1 0 The nucleotide and amino acid sequences (SEQ ID NOS: 7 and 8) of 

the fusion protein are shown in Figure 8. 

TMP-TMP-Fc. A DNA sequence coding for a tandem repeat of the 
TPO-mimetic peptide fused in-frame to the Fc region of human IgGl was 
constructed using standard PCR technology. Templates for PCR reactions 

1 5 were the EMP-Fc plasmid from strain #3688 (see Example 3) and a 
synthetic gene encoding the TMP dimer. The synthetic gene for the 
tandem repeat was constructed from the 7 overlapping oligonucleotides 
shown below (SEQ ID NOS: 377 to 383, respectively): 



20 


1885 


-52 


TTT 


TTT 


CAT 


ATG 


ATC 


GAA 


GGT 


CCG 


ACT 


CTG 


CGT 


CAG 


TGG 




1885 


-53 


AGC 


ACG 


AGC 


AGC 


CAG 


CCA 


CTG 


ACG 


CAG 


AGT 


CGG 


ACC 


TTC 








GAT 


CAT 


ATG 
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1885 


-54 


CTG 


GCT 


GCT 


CGT 


GCT 


GGT 


GGA 


GGC 


GGT 


GGG 


GAC 


AAA 


ACT 








CAC 


ACA 


























1885 


-55 


CTG 


GCT 


GCT 


CGT 


GCT 


GGC 


GGT 


GGT 


GGC 


GGA 


GGG 


GGT 


GGC 








ATT 


GAG 


GGC 


CCA 




















30 


































1885 


-56 


AAG 


CCA 


TTG 


GCG 


AAG 


GGT 


TGG 


GCC 


CTC 


AAT 


GCC 


ACC 


CCC 








TCC 


GCC 


ACC 


ACC 


GCC 




















1885 


-57 


ACC 


CTT 


CGC 


CAA 


TGG 


CTT 


GCA 


GCA 


CGC 


GCA 


GGG 


GGA 


GGC 


35 






GGT 


GGG 


GAC 


AAA 


ACT 




















1885 


-58 


CCC 


ACC 


GCC 


TCC 


CCC 


TGC 


GCG 


TGC 


TGC 











These oligonucleotides were annealed to form the duplex shown encoding 
40 an amino acid sequence shown below (SEQ ID NOS 384 and 385): 
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10 



15 



20 



TTTTTTCATATGATCGAAGGTCCGACTCTGCGTCAGTGGCTGGCTGCTCGTGCTGGCGGT 

1 + + + + + + 60 

GTATACTAGCTTCCAGGCTGAGACGCAGTCACCGACCGACGAGCACGACCGCCA 
a MXEGPTL'RQWLAARAGG 

GGTGGCGGAGGGGGTGGCATTGAGGGCCCAACCCTTCGCCAATGGCTGGCTGCTCGTGCT 

61 + + + + + + 120 

CCACCGCCTCCCCCACCGTAACTCCCGGGTTGGGAAGCGGTTACCGAACGTCGTGCGCGT 
a GGGGGGI EGPTLRQWLAA RA 

GGTGGAGGCGGTGGGGACAAAACTCTGGCTGCTCGTGCTGGTGGAGGCGGTGGGGACAAA 

121 + + + + + + 180 

CCCCCTCCGCCACCC 

a GGGGGDKTLAARAGGGGGDK 

ACTCACACA 
181 189 

a T H T 



This duplex was amplified in a PCR reaction using 1885-52 and 1885-58 as 
the sense and antisense primers. 

The Fc portion of the molecule was generated in a PCR reaction 
with DNA from the EMP-Fc fusion strain #3688 (see Example 3) using the 

2 5 primers 1885-54 and 1200-54. The full length fusion gene was obtained 

from a third PCR reaction using the outside primers 1885-52 and 1200-54. 

The final PCR gene product (the full length fusion gene) was 
digested with restriction endonucleases Xba l and BamHI, and then ligated 
into the vector pAMG21 and transformed into competent E. coli strain 

3 0 2596 cells as described for Fc-EMP herein. Clones were screened for the 

ability to produce the recombinant protein product and to possess the 
gene fusion having the correct nucleotide sequence. A single such clone 
was selected and designated Amgen strain #3798. 

The nucelotide and amino acid sequences (SEQ ID NOS: 9 and 10) 
35 of the fusion protein are shown in Figure 9. 

TMP-Fc . A DNA sequence coding for a monomer of the TPO- 
mimetic peptide fused in-frame to the Fc region of human IgGl was 
obtained fortuitously in the ligation in TMP-TMP-Fc, presumably due to 
the ability of primer 1885-54 to anneal to 1885-53 as well as to 1885-58. A 

4 0 single clone having the correct nucleotide sequence for the TMP-Fc 

construct was selected and designated Amgen strain #3788. 

H 
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The nucleotide and amino acid sequences (SEQ ID NOS: 11 and 12) 
of the fusion protein are shown in Figure 10. 

Expression in E. coli . Cultures of each of the pAMG21-Fc-fusion 
constructs in E. coli GM221 were grown at 37 °C in Luria Broth medium 
5 containing 50 mg/ml kanamycin. Induction of gene product expression 
from the luxPR promoter was achieved following the addition of the 
synthetic autoinducer N-(3-oxohexanoyl)-DL-homoserine lactone to the 
culture media to a final concentration of 20 ng/ml. Cultures were 
incubated at 37 °C for a further 3 hours. After 3 hours, the bacterial 

1 0 cultures were examined by microscopy for the presence of inclusion 
bodies and were then collected by centrifugation. Refractile inclusion 
bodies were observed in induced cultures indicating that the Fc-fusions 
were most likely produced in the insoluble fraction in E. coli . Cell pellets 
were lysed directly by resuspension in Laemmli sample buffer containing 

1 5 10% b-mercaptoethanol and were analyzed by SDS-PAGE. In each case, an 
intense coomassie-stained band of the appropriate molecular weight was 
observed on an SDS-PAGE gel. 

pAMG21 . The expression plasmid pAMG21 can be derived from 
the Amgen expression vector pCFM1656 (ATCC #69576) which in turn be 

2 0 derived from the Amgen expression vector system described in US Patent 
No. 4,710,473. The pCFM1656 plasmid can be derived from the described 
pCFM836 plasmid (Patent No. 4,710,473) by: 

(a) destroying the two endogenous Ndel restriction sites by end 
filling with T4 polymerase enzyme followed by blunt end 

2 5 ligation; 

(b) replacing the DNA sequence between the unique AatH and Clal 
restriction sites containing the synthetic Pl prontoter with a 

similar fragment obtained from pCFM636 (patent No. 4,710,473) 
containing the PL promoter (see SEQ ID NO: 386 below); and 

m 
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(c) substituting the small DNA sequence between the unique Clal 
and Kpnl restriction sites with the oligonucleotide having the 
sequence of SEQ ID NO: 388. 
SEQ ID NO: 386: 

5 Aatll 

5 ' CTAATTCCGCTCTCACCTACCAAACAATGCCCCCCTGCAAAAAATAAATTCATAT - 

3 ' TGCAGATTAAGGCGAGAGTGGATGGTTTGTTACGGGGGGACGTTTTTTATTTAAGTATA - 

- AAAAAAC ATACAGATAACC ATCTGC GGTGATAAATTATCTCTGGC GGTGTTGAC ATAAA - 
1 0 - TTTTTTGTATGTCTATTGGTAGACGCC ACTATTTAATAGAGACCGCCACAACTGTATTT - 

• TACCACTGGCGGTGATACTGAGCACAT 3 ' 
- ATGGTGACCGCCACTATGACTCGTGTAGC 5 ' 

Cla i 



15 



20 



SEQ ID NO: 387: 

5 ' CGATTTGATTCTAGAAGGAGGAATAACATATGGTTAACGCGTTGGAATTCGGTAC 3 ' 
3 ' TAAACTAAGATCTTCCTCCTTATTGTATACCAATTGCGCAACCTTAAGC 5 ' 

Clal Kpn l 



The expression plasmid pAMG21 can then be derived from pCFM1656 by 
making a series of site-directed base changes by PCR overlapping oligo 
mutagenesis and DNA sequence substitutions. Starting with the B glll site 
(plasmid bp # 180) immediately 5' to the plasmid replication promoter 
2 5 P copB and proceeding toward the plasmid replication genes, the base pair 
changes are as shown in Table B below. 



f oi 
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Table B — Base pair changes resulting in pAMG21 
PAMG21 bo# bp in PCFM1656 bp changed to in PAMG21 



5 


# 204 


T/A 


C/G 




If *+C.Q 


A/T 


WW 




# 509 


G/C 


A/T 




# 617 




insert two G/C 




# C7Q 


u/u 


T/A 


10 


# 980 


T/A 


C/G 




# 994 


G/C 


A/T 




#1004 


A/T 


C/G 




#1007 


C/G 


T/A 




#1028 


A/T 


T/A 


15 


#1047 


C/G 


T/A 




#1178 


G/C 


T/A 




#1466 


G/C 


T/A 


r 


#2028 


G/C 


bp deletion 




#2187 


C/G 


T/A 


20 


#2480 


A/T 


T/A 




# 2499-2502 


AGTG 


GTCA 






TCAC 


CAGT 


25 


#2642 


TCCGAGC 


7 bp deletion 






AGGCTCG 






#3435 


G/C 


A/T 




#3446 


G/C 


A/T 


30 


#3643 


A/T 


T/A 



/0> 
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The DNA sequence between the unique Aat n (position #4364 in 
pCFM1656) and SacII (position #4585 in pCFM1656) restriction sites is 
substituted with the DNA sequence (SEQ ID NO: 23) shown in Figures 
17A and 17B. During the ligation of the sticky ends of this substitution 
5 DNA sequence, the outside Aatn and Sac II sites are destroyed. There are 
unique Aat n and Sac II sites in the substituted DNA. 

GM221 (Amgen #2596 ). The Amgen host strain #2596 is an E.coli K- 
12 strain derived from Amgen strain #393. It has been modified to contain 
both the temperature sensitive lambda repressor cI857s7 in the early ebg 

1 0 region and the lacl 0 repressor in the late ebg region (68 minutes). The 
presence of these two repressor genes allows the use of this host with a 
variety of expression systems, however both of these repressors are 
irrelevant to the expression from luxP R . The untransformed host has no 
antibiotic resistances. 

1 5 The ribosome binding site of the cI857s7 gene has been modified to 

include an enhanced RBS. It has been inserted into the ebg operon 
between nucleotide position 1170 and 1411 as numbered in Genbank 
accession number M64441Gb_Ba with deletion of the intervening ebg 
sequence. The sequence of the insert is shown below with lower case 

2 0 letters representing the ebg sequences flanking the insert shown below 
(SEQ ID NO: 388): 

ttattttcg t GCGGCCGC ACCATTATCACCGCC AGAGGTAAACTAGTC AAC ACGCACGGTGTTAGATATTTAT 
CCC TTGC GGTGATAGATTGAGC AC ATCG ATTTGATTC TAGAAGGAGGGATAATAT ATGAGC AC AAAAAAGAAA 
CCATTAACACAAGAGCAGCTTGAGGACGCACGTCGCCTTAAAGCAATTTATGAAAAAAAGAAAAATGAACTTG 

2 5 GCTTATCCCAGGAATCTGTCGCAGACAAGATGGGGATGGGGCAGTCAGGCG 

CAATGCATTAAATGCTTATAACGCCGCATTGCTTACAAAAATTCTCAAAGTTAGCGTTGAAGAATTTAGCCCT 
TCAATCGCCAGAGAATCTACGAGATGTATGAAGCGGTTAGTATGCAGCCGTCACTTAGAAGTGAGTATGAGTA 

AGATGGGT AAGC AC AAC C AAAAAAGCCAGTGATTCTGC ATTC TGGC TTGAGGTTGAAGGT AATTCC ATG AC C G 
30 CACCAAC AGGCTCC AAGCC AAGC TTTCC TGACGGAATGTTAATTCTCGTTGACCCTGAGCAGGCTGTTGAGCC 
AGGTGATTTC TGC ATAGCC AGACTTGGGGGTGATGAGTTTACCTTC AAGAAACTGATC AGGGATAGC GGTC AG 
GTGTTTTTACAACCACTAAACCCACAGTACCCAATGATCCCATGCAATGAGAGTTGTTCCGTTGTGGGGAAAG 
TTATCGCTAGTCAGTGGCCTGAAGAGACGTTTGGCTGATAGACTAGTGGATCCACTAGTg 1 1 1 C tgccc 

3 5 The construct was delivered to the chromosome using a 

recombinant phage called MMebg-cI857s7enhanced RBS #4 into F'tet/393. 
After recombination and resolution only the chromosomal insert described 

101 
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above remains in the cell. It was renamed Ftet/GMIOI. F'tet/GMlOl was 
then modified by the delivery of a lacl° construct into the ebg operon 
between nucleotide position 2493 and 2937 as numbered in the Genbank 
accession number M64441Gb_Ba with the deletion of the intervening ebg 
5 sequence. The sequence of the insert is shown below with the lower case 
letters representing the ebg sequences flanking the insert (SEQ ID NO: 
389) shown below: 

ggcggaaaccGACGTCCATCGAATGGTGCAAAACCTTTCGCGGTATGGCATGATAGCGCCCGGAAGAGAGTCA 
ATTCAGGGTGGTGAATGTGAAACCAGTAACGTTATACGATGTCGCAGAGTATGCCGGTGTCTCTTATCAGACC 

10 GTTTCCCGCGTGGTGAACCAGGCCAGCCACGTTTCTGCGAAAACGCGGGAAAAAGTCGAAGCGGCGATGGCGG 
AGCTGAATTACATTCCCAACCGCGTGGCACAACAACTGGCGGGCAAACAGTCGCTCCTGATTGGCGTTGCCAC 
CTCCAGTCTGGCCCTGCACGCGCCGTCGCAAATTGTCGCGGCGATTAAATCTCGCGCCGATCAACTGGGTGCC 
AGCGTGGTGGTGTCGATGGTAGAACGAAGCGGCGTCGAAGCCTGTAAAGCGGCGGTGCACAATCTTCTCGCGC 
AACGCGTCAGTGGGCTGATCATTAACTATCCGCTGGATGACCAGGATGCCATTGCTGTGGAAGCTGCCTGCAC 

15 TAATGTTCCGGCGTTATTTCTTGATGTCTCTGACCAGACACCCATCAACAGTATTATTTTCTCCCATGAAGAC 
GGTACGCGACTGGGCGTGGAGCATCTGGTCGCATTGGGTCACCAGCAAATCGCGCTGTTAGCGGGCCCATTAA 
GTTCTGTCTCGGCGCGTCTGCGTCTGGCTGGCTGGCATAAATATCTCACTCGCAATCAAATTCAGCCGATAGC 
GGAACGGGAAGGCGACTGGAGTGCCATGTCCGGTTTTCAACAAACCATGCAAATGCTGAATGAGGGCATCGTT 
CCCACTGCGATGCTGGTTGCCAACGATCAGATGGCGCTGGGCGCAATGCGCGCCATTACCGAGTCCGGGCTGC 

20 GCGTTGGTGCGGATATCTCGGTAGTGGGATACGACGATACCGAAGACAGCTCATGTTATATCCCGCCGTTAAC 
CACCATCAAACAGGATTTTCGCCTGCTGGGGCAAACCAGCGTGGACCGCTTGCTGCAACTCTCTCAGGGCCAG 
GCGGTGAAGGGCAATCAGCTGTTGCCCGTCTCACTGGTGAAAAGAAAAACCACCCTGGCGCCCAATACGCAAA 
CCGCCTCTCCCCGCGCGTTGGCCGATTCATTAATGCAGCTGGCACGACAGGTTTCCCGACTGGAAAGCGGACA 
GTAAGGTACCATAGGATCCaggcacagga 

The construct was delivered to the chromosome using a 
recombinant phage called AGebg-LacIQ#5 into Ftet/GMIOI. After 
recombination and resolution only the chromosomal insert described 
above remains in the cell. It was renamed F'tet/GM221. The F'tet episome 
3 0 was cured from the strain using acridine orange at a concentration of 25 
Hg/ml in LB. The cured strain was identified as tetracyline sensitive and 
was stored as GM221. 



Expression . Cultures of pAMG21-Fc-TMP-TMP in E. coli GM221 in 
3 5 Luria Broth medium containing 50 |ig/ml kanamycin were incubated at 
37°C prior to induction. Induction of Fc-TMP-TMP gene product 
expression from the luxPR promoter was achieved following the addition 
of the synthetic autoinducer N-(3-oxohexanoyl)-DL-homoserine lactone to 
the culture media to a final concentration of 20 ng/ml and cultures were 
40 incubated at 37°C for a further 3 hours. After 3 hours, the bacterial 
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cultures were examined by microscopy for the presence of inclusion 
bodies and were then collected by centrifugation. Refractile inclusion 
bodies were observed in induced cultures indicating that the Fc-TMP-TMP 
was most likely produced in the insoluble fraction in E. coli . Cell pellets 
5 were lysed directly by resuspension in Laemmli sample buffer containing 
10% •-mercaptoethanol and were analyzed by SDS-PAGE. An intense 
Coomassie stained band of approximately 30kDa was observed on an 
SDS-PAGE gel. The expected gene product would be 269 amino acids in 
length and have an expected molecular weight of about 29.5 kDa. 

1 0 Fermentation was also carried out under standard batch conditions at the 
10 L scale, resulting in similar expression levels of the Fc-TMP-TMP to 
those obtained at bench scale. 

Purification of Fc-TMP-TMP . Cells are broken in water (1/10) by 
high pressure homogenization (2 passes at 14,000 PSI) and inclusion 

1 5 bodies are harvested by centrifugation (4200 RPM in J-6B for 1 hour). 

Inclusion bodies are solubilized in 6M guanidine, 50mM Tris, 8mM DTT, 
pH 8.7 for 1 hour at a 1 /10 ratio. The solubilized mixture is diluted 20 
times into 2M urea, 50 mM tris, 160mM arginine, 3mM cysteine, pH 8.5. 
The mixture is stirred overnight in the cold and then concentrated about 

20 10 fold by ultafiltration. It is then diluted 3 fold with lOmM Tris, 1.5M 
urea, pH 9. The pH of this mixture is then adjusted to pH 5 with acetic 
acid. The precipitate is removed by centrifugation and the supernatant is 
loaded onto a SP-Sepharose Fast Flow column equilibrated in 20mM 
NaAc, 100 mM NaCl, pH 5(10mg/ml protein load, room temperature). 

2 5 The protein is eluted off using a 20 column volume gradient in the same 
buffer ranging from lOOmM NaCl to 500mM NaCl. The pool from the 
column is diluted 3 fold and loaded onto a SP-Sepharose HP column in 20 
mM NaAc, 150 mM NaCl, pH 5(10 mg/ml protein load, room 
temperature). The protein is eluted off using a 20 column volume gradient 



WO 00/24782 



PCT/US99/25044 



in the same buffer ranging from 150 mM NaCl to 400 mM NaCl. The peak 
. is pooled and filtered. 

Characterization of Fc-TMP activity . The following is a summary of 
in vivo data in mice with various compounds of this invention. 
5 Mice: Normal female BDF1 approximately 10-12 weeks of age. 

Bleed schedule: Ten mice per group treated on day 0, two groups 
started 4 days apart for a total of 20 mice per group. Five mice bled at each 
time point, mice were bled a minimum of three times a week. Mice were 
anesthetized with isoflurane and a total volume of 140-160 yd of blood was 
1 0 obtained by puncture of the orbital sinus. Blood was counted on a 
Technicon HIE blood analyzer running software for murine blood. 
Parameters measured were white blood cells, red blood cells, hematocrit, 
hemoglobin, platelets, neutrophils. 

Treatments: Mice were either injected subcutaneously for a bolus 
1 5 treatment or implanted with 7-day micro-osmotic pumps for continuous 
delivery. Subcutaneous injections were delivered in a volume of 0.2 ml. 
Osmotic pumps were inserted into a subcutaneous incision made in the 
skin between the scapulae of anesthetized mice. Compounds were diluted 
in PBS with 0.1% BSA. All experiments included one control group, 
2 0 labeled "carrier" that were treated with this diluent only. The 

concentration of the test articles in the pumps was adjusted so that the 
calibrated flow rate from the pumps gave the treatment levels indicated in 
the graphs. 

Compounds: A dose titration of the compound was delivered to 
2 5 mice in 7 day micro-osmotic pumps. Mice were treated with various 

compounds at a single dose of 100 Mg/kg in 7 day osmotic pumps. Some 
of the same compounds were then given to mice as a single bolus injection. 

Activity test results: The results of the activity experiments are 
shown in Figures 11 and 12. In dose response assays using 7-day micro- 
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osmotic pumps, the maximum effect was seen with the compound of SEQ 
ID NO: 18 was at 100 Mg/kg/day; the 10 fig/kg/day dose was about 50% 
maximally active and 1 |ig/kg/day was the lowest dose at which activity 
could be seen in this assay system. The compound at 10 ng/kg/day dose 
5 was about equally active as 100 jig/kg/day unpegylated rHu-MGDF in 
the same experiment. 
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Example 3 
Fc-EMP f Mgipns 

Fc-EMP . A DNA sequence coding for the Fc region of human IgGl 
fused in-frame to a monomer of the EPO-mimetic peptide was constructed 
5 using standard PCR technology. Templates for PCR reactions were a 
vector containing the Fc sequence (pFc-A3, described in International 
application WO 97/23614, published July 3, 1997) and a synthetic gene 
encoding EPO monomer. The synthetic gene for the monomer was 
constructed from the 4 overlapping oligonucleotides (SEQ ID NOS: 390 to 
1 0 393, respectively) shown below: 

1798-2 TAT GAA AGG TGG AGG TGG TGG TGG AGG TAC TTA CTC TTG 
CCA CTT CGG CCC GCT GAC TTG G 

15 1798-3 CGG TTT GCA AAC CCA AGT CAG CGG GCC GAA* GTG GCA AGA 
GTA AGT ACC TCC ACC ACC ACC TCC ACC TTT CAT 



20 



45 



1798-4 GTT TGC AAA CCG CAG GGT GGC GGC GGC GGC GGC GGT GGT 
ACC TAT TCC TGT CAT TTT 

1798-5 CCA GGT CAG CGG GCC AAA ATG ACA GGA ATA GGT ACC ACC 
GCC GCC GCC GCC GCC ACC CTG 



The 4 oligonucleotides were annealed to form the duplex encoding an 

2 5 amino acid sequence (SEQ ID NOS: 394 and 395, respectively) shown 

below: 

TATGAAAGGTGGAGGTGGTGGTGGAGGTACTTACTCTTGCCACTTCGGCCCGCTGACTTG 
1 + + + ♦ + + 60 

3 0 T ACTTTC C AC CTCCAC CACC ACCTCC ATGAATGAG AACGGTG AAGCCGGGCG ACTG AAC 

b MKGGGGGGGTYSCHFGPLTW 

GGTTTGCAAACCGCAGGGTGGCGGCGGCGGCGGCGGTGGTACCTATTCCTGTCATTTT 

3 5 CCAAACGTTTGGCGTCCCACCGCCGCCGCCGCCGCCACCATGGATAAGGACAGTAAAACCGGGCGACTGGACC 
bVCKPQGGGGGGGGTYSCHF 

This duplex was amplified in a PCR reaction using 

40 1798-18 GCA GAA GAG CCT CTC CCT GTC TCC GGG TAA 

AGG TGG AGG TGG TGG TGG AGG TAC TTA 
CTC T 



and 



1798-19 CTA ATT GGA TCC ACG AGA TTA ACC ACC 

CTG CGG TTT GCA A 
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as the sense and antisense primers (SEQ ID NOS: 396 and 397, 
respectively). 

The Fc portion of the molecule was generated in a PCR reaction 
with pFc-A3 using the primers 

5 

1216-52 AAC ATA AGT ACC TGT AGG ATC G 

1798-17 AGA GTA AGT ACC TCC ACC ACC ACC TCC ACC TTT ACC CGG 

AGA CAG GGA GAG GCT CTT CTG C 

10 

which are SEQ ID NOS: 398 and 399, respectively. The oligonucleotides 
1798-17 and 1798-18 contain an overlap of 61 nucleotides, allowing the two 
genes to be fused together in the correct reading frame by combining the 
above PCR products in a third reaction using the outside primers, 1216-52 

15 and 1798-19. 

The final PCR gene product (the full length fusion gene) was 
digested with restriction endonucleases Xba l and Bam HL and then ligated 
into the vector pAMG21 (described below), also digested with Xba l and 
BamHL Ligated DNA was transformed into competent host cells of E. coli 

2 0 strain 2596 (GM221, described herein). Clones were screened for the ability 
to produce the recombinant protein product and to possess the gene 
fusion having the correct nucleotide sequence. A single such clone was 
selected and designated Amgen strain #3718. 

The nucleotide and amino acid sequence of the resulting fusion 

2 5 protein (SEQ ID NOS: 15 and 16) are shown in Figure 13. 

EMP-Fc . A DNA sequence coding for a monomer of the EPO- 
mimetic peptide fused in-frame to the Fc region of human IgGl was 
constructed using standard PCR technology. Templates for PCR reactions 
were the pFC-A3a vector and a synthetic gene encoding EPO monomer. 

3 0 The synthetic gene for the monomer was constructed from the 4 

overlapping oligonucleotides 1798-4 and 1798-5 (above) and 1798-6 and 
1798-7 (SEQ ID NOS: 400 and 401, respectively) shown below: 
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1798-6 GGC CCG CTG ACC TGG GTA TGT AAG CCA CAA GGG GGT GGG 
GGA GGC GGG GGG TAA TCT CGA G 

5 1798-7 GAT CCT CGA GAT TAC CCC CCG CCT CCC CCA CCC CCT TGT 
GGC TTA CAT AC 

The 4 oligonucleotides were annealed to form the duplex encoding an 
amino acid sequence (SEQ ID NOS: 402 and 403, respectively) shown 
10 below: 

GTTTGCAAACCGCAGGGTGGCGGCGGCGGCGGCGGTGGTACCTATTCCTGTCATTTTGGC 

1 + + + + + + 60 

GTCCCACCGCCGCCGCCGCCGCCACCATGGATAAGGACAGTAAAACCG 
15 A VCKPQGGGGGGGGTYSCHPG 

CCGCTGACCTGGGTATGTAAGCCACAAGGGGGTGGGGGAGGCGGGGGGTAATCTCGAG 

61 + + + + + + « 122 

GGCGACTGGACCCATACATTCGGTGTTCCCCCACCCCCTCCGCCCCCCATTAGAGCTCCTAG 
20 A PLTWVCKPQGGGGGGG * 

This duplex was amplified in a PCR reaction using 



25 



30 



35 



1798-21 TTA TTT CAT ATG AAA GGT GGT AAC TAT TCC TGT CAT TTT 

and 

1798-22 TGG ACA TGT GTG AGT TTT GTC CCC CCC GCC TCC CCC ACC 

CCC T 

as the sense and antisense primers (SEQ ID NOS: 404 and 405, 
respectively). 

The Fc portion of the molecule was generated in a PCR reaction 
with pFc-A3 using the primers 

1798-23 AGG GGG TGG GGG AGG CGG GGG GGA CAA AAC TCA CAC ATG 

TCC A 



and 

4 0 1200-54 GTT ATT GCT CAG CGG TGG CA 

which are SEQ ID NOS: 406 and 407, respectively. The oligonucleotides 
1798-22 and 1798-23 contain an overlap of 43 nucleotides, allowing the two 
genes to be fused together in the correct reading frame by combining the 
4 5 above PCR products in a third reaction using the outside primers, 1787-21 
and 1200-54. 

The final PCR gene product (the full length fusion gene) was 
digested with restriction endonucleases Xba l and Bam HI, and then ligated 

HO 
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into the vector pAMG21 and transformed into competent E. coli strain 
2596 cells as described above. Clones were screened for the ability to 
produce the recombinant protein product and to possess the gene fusion 
having the correct nucleotide sequence. A single such clone was selected 
5 and designated Amgen strain #3688. 

The nucleotide and amino acid sequences (SEQ ID NOS: 17 and 18) 
of the resulting fusion protein are shown in Figure 14. 

EMP-EMP-Fc A DNA sequence coding for a dimer of the EPO- 
mimetic peptide fused in-frame to the Fc region of human IgGl was 
1 0 constructed using standard PCR technology. Templates for PCR reactions 
were the EMP-Fc plasmid from strain #3688 above and a synthetic gene 
encoding the EPO dimer. The synthetic gene for the dimer was 
constructed from the 8 overlapping oligonucleotides (SEQ ID NO&408 to 
415, respectively) shown below: 

15 





1869 


-23 


TTT 
TAG 


TTT 
AAG 


ATC 
GAG 


GAT 
GAA 


TTG 
TAA 


ATT 
AAT 


CTA 
ATG 


GAT 


TTG 


AGT 


TTT 


AAC 


TTT 


20 


1869 


-48 


TAA 
AA 


AAG 


TTA 


AAA 


CTC 


AAA 


TCT 


AGA 


ATC 


AAA 


TCG 


ATA 


AAA 




1871 


-72 


GGA 
GTT 


GGT 
TGC 


ACT 
AAA 


TAC 
CCG 


TCT 


TGC 


CAC 


TTC 


GGC 


CCG 


CTG 


ACT 


TGG 


25 


1871 


-73 


AGT 
ATT 


CAG 
TTA 


CGG 
TTC 


GCC 
CTC 


GAA 
CTT 


GTG 
C 


GCA 


AGA 


GTA 


AGT 


ACC 


TCC 


CAT 




1871 


-74 


CAG 


GGT 


GGC 


GGC 


GGC 


GGC 


GGC 


GGT 


GGT 


ACC 


TAT 


TCC 


TGT 


30 






CAT 


TTT 


GGC 


CCG 


CTG ACC 


TGG 
















1871 


-75 


AAA ATG 


ACA 


GGA ATA 


GGT 


ACC 


ACC 


GCC 


GCC 


GCC 


GCC 


GCC 








ACC 


CTG 


CGG 


TTT 


GCA 


AAC 


CCA 














35 


1871 


-78 


GTA 


TGT 


AAG 


CCA 


CAA 


GGG 


GGT 


GGG 


GGA 


GGC 


GGG 


GGG 


GAC 






AAA 


ACT 


CAC 


ACA 


TGT 


CCA 


















1871 


-79 


AGT 
ACA 


TTT 
TAC 


GTC 
CCA 


CCC 
GGT 


CCC 
CAG 


GCC 
CGG 


TCC 
GCC 


CCC 


ACC 


CCC 


TTG 


TGG 


CTT 



4 0 The 8 oligonucleotides were annealed to form the duplex encoding an 
amino acid sequence (SEQ ID NOS: 416 and 417, respectively) shown 
below: 



TTTTTTATCGATTTGATTCTAGATTTGAGTTTTAACTTTTAGAAGGAGGAATAAAATATG 

45 1 + + + + + + 60 

AAAAAAT AGC T AAACT AAG ATC T AAAC TC AAAATTG AAAATC TTC C TC C TT ATTTT AT AC 
a M 
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GGAGGTACTTACTCTTGCCACTTCGGCCCGCTGACTTGGGTTTGCAAACCGCAGGGTGGC 

61 + + + + + + 120 

CCTCCATGAATGAGAACGGTCAAGCCGGGCGACTGAACCCAAACGTTTGGCGTCCCACCG 
5 a GGTYSCHFGPLTWVCKPQGG 

GGCGGCGGCGGCGGTGGTACCTATTCCTGTCATTTTGGCCCGCTGACCTGGGTATGTAAG 

121 + + + + + + 180 

CCGCCGCCGCCGCCACCATGGATAAGGACAGTAAAACCGGGCGACTGGACCCATACATTC 
10 a GGGGGGTYSCHPGPLTWVCK 

CCACAAGGGGGTGGGGGAGGCGGGGGGGACAAAACTCACACATGTCCA 

181 + + + + 228 

GGTGTTCCCCCACCCCCTCCGCCCCCCCTGTTTTGA 
15 a PQGGGGGGGDKTHTCP 

This duplex was amplified in a PCR reaction using 1869-23 and 
1871-79 (shown above) as the sense and antisense primers. 

The Fc portion of the molecule was generated in a PCR reaction 
2 0 with strain 3688 DNA using the primers 1798-23 and 1200-54 (shown 
above). 

The oligonucleotides 1871-79 and 1798-23 contain an overlap of 31 
nucleotides, allowing the two genes to be fused together in the correct 
reading frame by combining the above PCR products in a third reaction 

2 5 using the outside primers, 1869-23 and 1200-54. 

The final PCR gene product (the full length fusion gene) was 
digested with restriction endonucleases Xba l and Bam HI, and then ligated 
into the vector pAMG21 and transformed into competent E. coli strain 
2596 cells as described for Fc-EMP. Clones were screened for ability to 

3 0 produce the recombinant protein product and possession of the gene 

fusion having the correct nucleotide sequence. A single such clone was 
selected and designated Amgen strain #3813. 

The nucleotide and amino acid sequences (SEQ ID NOS: 19 and 20, 
respectively) of the resulting fusion protein are shown in Figure 15. There 
35 is a silent mutation at position 145 (A to G, shown in boldface) such that 
the final construct has a different nucleotide sequence than the 
oligonucleotide 1871-72 from which it was derived. 

Fc-EMP-EMP. A DNA sequence coding for the Fc region of human 
IgGl fused in-frame to a dimer of the EPO-mimetic peptide was 
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constructed using standard PCR technology. Templates for PCR reactions 
were the plasmids from strains 3688 and 3813 above. 

The Fc portion of the molecule was generated in a PCR reaction 
with strain 3688 DNA using the primers 1216-52 and 1798-17 (shown 
5 above). The EMP dimer portion of the molecule was the product of a 

second PCR reaction with strain 3813 DNA using the primers 1798-18 (also 
shown above) and SEQ ID NO: 418, shown below: 

1798-20 CTA ATT GGA TCC TCG AGA TTA ACC CCC TTG TGG CTT ACAT 

10 

The oligonucleotides 1798-17 and 1798-18 contain an overlap of 61 
nucleotides, allowing the two genes to be fused together in the correct 
reading frame by combining the above PCR products in a third reaction 
using the outside primers, 1216-52 and 1798-20. 
1 5 The final PCR gene product (the full length fusion gene) was 

digested with restriction endonucleases Xba l and Bam HL and then ligated 
into the vector pAMG21 and transformed into competent E. coli strain 
2596 cells as described for Fc-EMP. Clones were screened for the ability to 
produce the recombinant protein product and to possess the gene fusion 
2 0 having the correct nucleotide sequence. A single such clone was selected 
and designated Amgen strain #3822. 

The nucleotide and amino acid sequences (SEQ ID NOS: and , 

respectively) of the fusion protein are shown in Figure 16. 

Characterization of Fc-EMP activity . Characterization was carried 

2 5 out in vivo as follows. 

Mice: Normal female BDF1 approximately 10-12 weeks of age. 

Bleed schedule: Ten mice per group treated on day 0, two groups 
started 4 days apart for a total of 20 mice per group. Five mice bled at 
each time point, mice were bled a maximum of three times a week. Mice 

3 0 were anesthetized with isoflurane and a total volume of 140-160 ml of 

blood was obtained by puncture of the orbital sinus. Blood was counted 



r 
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on a Technicon HIE blood analyzer running software for murine blood. 
Parameters measured were WBC, RBC, HCT, HGB, PLT, NEUT, LYMPH. 

Treatments: Mice were either injected subcutaneously for a bolus 
treatment or implanted with 7 day micro-osmotic pumps for continuous 
5 delivery. Subcutaneous injections were delivered in a volume of 0.2 ml. 
Osmotic pumps were inserted into a subcutaneous incision made in the 
skin between the scapulae of anesthetized mice. Compounds were diluted 
in PBS with 0.1% BSA. All experiments included one control group, 
labeled // carrier ,/ that were treated with this diluent only. The 
1 0 concentration of the test articles in the pumps was adjusted so that the 

calibrated flow rate from the pumps gave the treatment levels indicated in 
the graphs. 

Experiments: Various Fc-conjugated EPO mimetic peptides (EMPs) 
were delivered to mice as a single bolus injection at a dose of 100 ^lg/kg. 
1 5 Fc-EMPs were delivered to mice in 7-day micro-osmotic pumps. The 
pumps were not replaced at the end of 7 days. Mice were bled until day 
51 when HGB and HCT returned to baseline levels. 

Example 4 
TNF-a inhibitors 

2 0 Fc-TNF-a inhibitors . A DNA sequence coding for the Fc region of 

human IgGl fused in-frame to a monomer of the TNF-a inhibitory peptide 
was constructed using standard PCR technology. The Fc and 5 glycine 
linker portion of the molecule was generated in a PCR reaction with DNA 
from the Fc-EMP fusion strain #3718 (see Example 3) using the sense 

25 primer 1216-52 and the antisense primer 2295-89 (SEQ ID NOS: 1112 and 
1113 , respectively). The nucleotides encoding the TNF-a inhibitory 
peptide were provided by the PCR primer 2295-89 shown below: 



30 



1216-52 
2295-89 



AAC ATA AGT ACC TGT AGG ATC G 

CCG CGG ATC CAT TAC GGA CGG TGA CCC AGA GAG GTG TTT TTG TAG 
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TGC GGC AGG AAG TCA CCA CCA CCT CCA CCT TTA CCC 

The oligonucleotide 2295-89 overlaps the glycine linker and Fc portion of 
the template by 22 nucleotides, with the PCR resulting in the two genes 
5 being fused together in the correct reading frame. 

The PCR gene product (the full length fusion gene) was digested 
with restriction endonucleases Ndel and Bam HI, and then ligated into the 
vector pAMG21 and transformed into competent E. coli strain 2596 cells as 
described for EMP-Fc herein. Clones were screened for the ability to 

10 produce the recombinant protein product and to possess the gene fusion 
having the correct nucleotide sequence. A single such clone was selected 
and designated Amgen strain #4544. 

The nucleotide and amino acid sequences (SEQ ID NOS: 1055 and 
1056) of the fusion protein are shown in Figures 19A and 19B. 

1 5 TNF-q inhibitor-Fc . A DNA.sequence coding for a TNF-a inhibitory 

peptide fused in-frame to the Fc region of human IgGl was constructed 
using standard PCR technology. The template for the PCR reaction was a 
plasmid containing an unrelated peptide fused via a five glycine linker to 
Fc. The nucleotides encoding the TNF-a inhibitory peptide were 

2 0 provided by the sense PCR primer 2295-88, with primer 1200-54 serving as 
the antisense primer (SEQ ID NOS: 1117 and 407, respectively). The 
primer sequences are shown below: 

2295-88 GAA TAA CAT ATG GAC TTC CTG CCG CAC TAC AAA AAC ACC TCT CTG GGT 

25 CAC CGT CCG GGT GGA GGC GGT GGG GAC AAA ACT 



1200-54 GTT ATT GCT CAG CGG TGG CA 

30 

The oligonucleotide 2295-88 overlaps the glycine linker and Fc portion of 
the template by 24 nucleotides, with the PCR resulting in the two genes 
being fused together in the correct reading frame. 



Iir 
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The PCR gene product (the full length fusion gene) was digested 
with restriction endonucleases Ndel and BamHI, and then ligated into the 
vector pAMG21 and transformed into competent E. coli strain 2596 cells as 
described for EMP-Fc herein. Clones were screened for the ability to 
5 produce the recombinant protein product and to possess the gene fusion 
having the correct nucleotide sequence. A single such clone was selected 
and designated Amgen strain #4543. 

The nucleotide and amino acid sequences (SEQ ID NOS: 1057 and 1058) of 
the fusion protein are shown in Figures 20A and 20B. 

1 0 Expression in E. coli . Cultures of each of the p AMG21 -Fc-fusion 

constructs in E. coli GM221 were grown at 37 °C in Luria Broth medium 
containing 50 mg/ml kanamycin. Induction of gene product expression 
from the luxPR promoter was achieved following the addition of the 
synthetic autoinducer N-(3-oxohexanoyl)-DL-homoserine lactone to the 

1 5 culture media to a final concentration of 20 ng/ml. Cultures were 
incubated at 37 °C for a further 3 hours. After 3 hours, the bacterial 
cultures were examined by microscopy for the presence of inclusion 
bodies and were then collected by centrifugation. Refractile inclusion 
bodies were observed in induced cultures indicating that the Fc-fusions 

2 0 were most likely produced in the insoluble fraction in E. coli . Cell pellets 
were lysed directly by resuspension in Laemmli sample buffer containing 
10% p-mercaptoethanol and were analyzed by SDS-PAGE. In each case, an 
intense coomassie-stained band of the appropriate molecular weight was 
observed on an SDS-PAGE gel. 

25 Purification of Fc-peptide fusion proteins . Cells are broken in water 

(1/10) by high pressure homogenization (2 passes at 14,000 PSI) and 
inclusion bodies are harvested by centrifugation (4200 RPM in J-6B for 1 
hour). Inclusion bodies are solubilized in 6M guanidine, 50mM Tris, 8mM 
DTT, pH 8.7 for 1 hour at a 1/10 ratio. The solubilized mixture is diluted 
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20 times into 2M urea, 50 mM tris, 160mM arginine, 3mM cysteine, pH 8.5. 
The mixture is stirred overnight in the cold and then concentrated about 
10 fold by ultafiltration. It is then diluted 3 fold with lOmM Tris, 1.5M 
urea, pH 9. The pH of this mixture is then adjusted to pH 5 with acetic 
5 acid. The precipitate is removed by centrifugation and the supernatant is 
loaded onto a SP-Sepharose Fast Flow column equilibrated in 20mM 
NaAc, 100 mM NaCl, pH 5 (lOmg/ml protein load, room temperature). 
The protein is eluted from the column using a 20 column volume gradient 
in the same buffer ranging from lOOmM NaCl to 500mM NaCl. The pool 

1 0 from the column is diluted 3 fold and loaded onto a SP-Sepharose HP 

column in 20mM NaAc, 150mM NaCl, pH 5(10mg/ml protein load, room 
temperature). The protein is eluted using a 20 column volume gradient in 
the same buffer ranging from 150mM NaCl to 400mM NaCl. The peak is 
pooled and filtered. 

15 Characterization of activity of Fc-TNF-a inhibitor and TNF-a 

inhibitor -Fc . Binding of these peptide fusion proteins to TNF- a can be 
characterized by BIAcore by methods available to one of ordinary skill in 
the art who is armed with the teachings of the present specification. 

Example 5 

20 IL-1 Antagonists 

Fc-IL-1 antagonist . A DNA sequence coding for the Fc region of 
human IgGl fused in-frame to a monomer of an IL-1 antagonist peptide 
was constructed using standard PCR technology. The Fc and 5 glycine 
linker portion of the molecule was generated in a PCR reaction with DNA 

2 5 from the Fc-EMP fusion strain #3718 (see Example 3) using the sense 

primer 1216-52 and the antisense primer 2269-70 (SEQ ID NOS: 1112 and 
1118, respectively). The nucleotides encoding the IL-1 antagonist peptide 
were provided by the PCR primer 2269-70 shown below: 



WO 00/24782 



PCT/US99/25044 



1216-52 



AAC ATA AGT ACC TGT AGG ATC G 



2269-70 



CCG CGG ATC CAT TAC AGC GGC AGA GCG TAC GGC TGC CAG TAA CCC 
GGG GTC CAT TCG AAA CCA CCA CCT CCA CCT TTA CCC 



5 



The oligonucleotide 2269-70 overlaps the glycine linker and Fc portion of 
the template by 22 nucleotides, with the PCR resulting in the two genes 
being fused together in the correct reading frame. 



with restriction endonucleases Ndel and BamHI, and then ligated into the 
vector pAMG21 and transformed into competent E. coli strain 2596 cells as 
described for EMP-Fc herein. Clones were screened for the ability to 
produce the recombinant protein product and to possess the gene fusion 

1 5 having the correct nucleotide sequence. A single such clone was selected 
and designated Amgen strain #4506. 

The nucleotide and amino acid sequences (SEQ ID NOS: 1059 and 
1060) of the fusion protein are shown in Figures 21 A and 21B. 

IL-1 antagonist-Fc A DNA sequence coding for an IL-1 antagonist 

2 0 peptide fused in-frame to the Fc region of human IgGl was constructed 
using standard PCR technology. The template for the PCR reaction was a 
plasmid containing an unrelated peptide fused via a five glycine linker to 
Fc. The nucleotides encoding the IL-1 antagonist peptide were provided 
by the sense PCR primer 2269-69, with primer 1200-54 serving as the 

2 5 antisense primer (SEQ ID NOS: 1119 and 407, respectively). The primer 
sequences are shown below: 

2269-69 GAA TAA CAT ATG TTC GAA TGG ACC CCG GGT TAC TGG CAG CCG TAC GCT 

30 CTG CCG CTG GGT GGA GGC GGT GGG GAC AAA ACT ' 

1200-54 GTT ATT GCT CAG CGG TGG CA 



10 



The PCR gene product (the full length fusion gene) was digested 
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The oligonucleotide 2269-69 overlaps the glycine linker and Fc portion of 
the template by 24 nucleotides, with the PCR resulting in the two genes 
being fused together in the correct reading frame. 
5 The PCR gene product (the full length fusion gene) was digested 

with restriction endonucleases Ndel and BamHI, and then ligated into the 
vector p AMG21 and transformed into competent E. coli strain 2596 cells as 
described for EMP-Fc herein. Clones were screened for the ability to 
produce the recombinant protein product and to possess the gene fusion 

1 0 having the correct nucleotide sequence. A single such clone was selected 
and designated Amgen strain #4505. 

The nucleotide and amino acid sequences (SEQ ID NOS: 1061 and 
1062) of the fusion protein are shown in Figures 22A and 22B. Expression 
and purification were carried out as in previous examples. 

15 Characterization of Fc-IL-1 antagonist peptide and IL-1 antagonist 

peptide-Fc activity . IL-1 Receptor Binding competition between IL-ip, IL- 
1RA and Fc-conjugated IL-1 peptide sequences was carried out using the 
IGEN system. Reactions contained 0.4 nM biotin-IL-lR + 15 nM IL-l-TAG 
+ 3 uM competitor + 20 ug/ ml streptavidin-conjugate beads, where 

20 competitors were IL-1RA, Fc-IL-1 antagonist, IL-1 antagonist-Fc). 

Competition was assayed over a range of competitor concentrations from 
3 uM to 1.5 pM. The results are shown in Table C below: 



lit 
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Table C — Results from IL-1 Receptor Binding Competition Assay 



IL-lpep-Fc FolL-lpep IL-1ra 

5 Kl 281.5 59.58 1.405 

EC50 530.0 112.2 2.645 

95% Confidence Intervals 

10 EC50 280.2 to 1002 54.75 to 229.8 1.149 to 

6.086 

Kl 148.9 to 532.5 29.08 to 122.1 0.6106 to 

3.233 

15 

Goodness of Fit 

R* 0.9790 0.9687 0.9602 



/>t> 
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Example 6 
VEGF-Antagonists 

Fc-VEGF Antagonist . A DNA sequence coding for the Fc region of 
5 human IgGl fused in-frame to a monomer of the VEGF mimetic peptide 
was constructed using standard PCR technology. The templates for the 
PCR reaction were the pFc-A3 piasmid and a synthetic VEGF mimetic 
peptide gene. The synthetic gene was assembled by annealing the 
following two oligonucleotides primer (SEQ ID NOS: 1120 and 1121, 
10 respectively): 

2293-11 GTT GAA CCG AAC TGT GAC ATC CAT GTT ATG TGG GAA TGG GAA 

TGT TTT GAA CGT CTG 

2293-12 CAG ACG TTC AAA AC A TTC CCA TTC CCA CAT AAC ATG GAT GTC 

15 ACA GTT CGG TTC AAC 

The two oligonucleotides anneal to form the following duplex encoding 
an amino acid sequence shown below (SEQ ID NOS 1122 ): 



20 

GTTGAACCGAACTGTGACATCCATGTTATGTGGGAATGGGAATGTTTTGAACGTCTG 

1 + + + + + 57 

CAACTTGGCTTGACACTGTAGGTACAATACACCCTTACCCTTACAAAACTTGCAGAC 

25 a VEPNCDIHVMWEWECFERL 



This duplex was amplified in a PCR reaction using 2293-05 and 2293-06 as 
the sense and antisense primers (SEQ ID NOS. 1125 and 1126). 
3 0 The Fc portion of the molecule was generated in a PCR reaction 

with the pFc-A3 piasmid using the primers 2293-03 and 2293-04 as the 
sense and antisense primers (SEQ ID NOS. 1123 and 1124 r respectively). 
The full length fusion gene was obtained from a third PCR reaction using 
the outside primers 2293-03 and 2293-06. These primers are shown below: 
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2293-03 ATT TGA TTC TAG AAG GAG GAA TAA CAT ATG GAC AAA ACT CAC 

ACA TGT 

5 2293-04 GTC ACA GTT CGG TTC AAC ACC ACC ACC ACC ACC TTT ACC CGG 

AGA CAG GGA 

2293-05 TCC CTG TCT CCG GGT AAA GGT GGT GGT GGT GGT GTT GAA CCG 

AAC TGT GAC ATC 



10 



2293-06 CCG CGG ATC CTC GAG TTA CAG ACG TTC AAA ACA TTC CCA 



The PCR gene product (the full length fusion gene) was digested 
with restriction endonucleases Nde l and BamHI, and then ligated into the 

1 5 vector p AMG21 and transformed into competent E. coli strain 2596 cells as 
described for EMP-Fc herein. Clones were screened for the ability to 
produce the recombinant protein product and to possess the gene fusion 
having the correct nucleotide sequence. A single such clone was selected 
and designated Amgen strain #4523. 

2 0 The nucleotide and amino acid sequences (SEQ ID NOS: 1063 and 

1064) of the fusion protein are shown in Figures 23A and 23B. 

VEGF antagonist -Fc . A DNA sequence coding for a VEGF mimetic 
peptide fused in-frame to the Fc region of human IgGl was constructed 
using standard PCR technology. The templates for the PCR reaction were 

2 5 the pFc- A3 plasmid and the synthetic VEGF mimetic peptide gene 

described above. The synthetic duplex was amplified in a PCR reaction 
using 2293-07 and 2293-08 as the sense and antisense primers (SEQ ID 
NOS. 1127 and 1128, respectively). 

The Fc portion of the molecule was generated in a PCR reaction 

3 0 with the pFc-A3 plasmid using the primers 2293-09 and 2293-10 as the 

sense and antisense primers (SEQ ID NOS. 1129 and 1130, respectively). 
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The full length fusion gene was obtained from a third PCR reaction using 
the outside primers 2293-07 and 2293-10. These primers are shown below: 

2293-07 ATT TGA TTC TAG AAG GAG GAA TAA CAT ATG GTT GAA CCG AAC 

5 TGT GAC 

2293-08 ACA TGT GTG AGT TTT GTC ACC ACC ACC ACC ACC CAG ACG TTC 

AAA ACA TTC 

10 2293-09 GAA TGT TTT GAA CGT CTG GGT GGT GGT GGT GGT GAC AAA ACT 

CAC ACA TGT 

2293-10 CCG CGG ATC CTC GAG TTA TTT ACC CGG AGA CAG GGA GAG 

The PCR gene product (the full length fusion gene) was digested 
1 5 with restriction endonucleases Ndel and BamHL and then ligated into the 
vector pAMG21 and transformed into competent E. coli strain 2596 cells as 
described for EMP-Fc herein. Clones were screened for the ability to 
produce the recombinant protein product and to possess the gene fusion 
having the correct nucleotide sequence. A single such clone was selected 

2 0 and designated Amgen strain #4524. 

The nucleotide and amino acid sequences (SEQ ID NOS: 1065 and 
1066) of the fusion protein are shown in Figures 24 A and 24B. Expression 
and purification were carried out as in previous examples. 

25 Example 7 

MMP Inhibitors 

Fc-MMP inhibitor . A DNA sequence coding for the Fc region of 
human IgGl fused in-frame to a monomer of an MMP inhibitory peptide 
was constructed using standard PCR technology. The Fc and 5 glycine 

3 0 linker portion of the molecule was generated in a PCR reaction with DNA 

from the Fc-TNF-a inhibitor fusion strain #4544 (see Example 4) using the 
sense primer 1216-52 and the antisense primer 2308-67 (SEQ ID NOS: 1112 
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and 1131, respectively). The nucleotides encoding the MMP inhibitor 
peptide were provided by the PCR primer 2308-67 shown below: 



1216-52 



AAC ATA AGT ACC TGT AGG ATC G 



5 



2308-67 



CCG CGG ATC CAT TAG CAC AGG GTG AAA CCC CAG TGG GTG GTG 
CAA CCA CCA CCT CCA CCT TTA CCC 



The oligonucleotide 2308-67 overlaps the glycine linker and Fc portion of 
1 0 the template by 22 nucleotides, with the PCR resulting in the two genes 

being fused together in the correct reading frame. 

The PCR gene product (the full length fusion gene) was digested 

with restriction endonucleases Ndel and BamHI, and then ligated into the 

vector pAMG21 and transformed into competent E. coli strain 2596 cells as 
1 5 described for EMP-Fc herein. Clones were screened for the ability to 

produce the recombinant protein product and to possess the gene fusion 

having the correct nucleotide sequence. A single such clone was selected 

and designated Amgen strain #4597. 



2 0 1068) of the fusion protein are shown in Figures 25 A and 25B. Expression 
and purification were carried out as in previous examples. 

MMP Inhibitor-Fc . A DNA sequence coding for an MMP inhibitory 
peptide fused in-frame to the Fc region of human IgGl was constructed 
using standard PCR technology. The Fc and 5 glycine linker portion of the 

2 5 molecule was generated in a PCR reaction with DNA from the Fc-TNF-ot 
inhibitor fusion strain #4543 (see Example 4). The nucleotides encoding 
the MMP inhibitory peptide were provided by the sense PCR primer 2308- 
66, with primer 1200-54 serving as the antisense primer (SEQ ID NOS: 
1132 and 407, respectively). The primer sequences are shown below: 



The nucleotide and amino acid sequences (SEQ ID NOS: 1067 and 



30 



2308-66 



GAA TAA CAT ATG TGC ACC ACC CAC TGG GGT TTC ACC CTG TGC 
GGT GGA GGC GGT GGG GAC AAA 



35 



1200-54 



GTT ATT GCT CAG CGG TGG CA 
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The oligonucleotide 2269-69 overlaps the glycine linker and Fc portion of 

the template by 24 nucleotides, with the PCR resulting in the two genes 

being fused together in the correct reading frame. 

5 The PCR gene product (the full length fusion gene) was digested 

with restriction endonucleases Ndel and BamHI, and then ligated into the 

vector pAMG21 and transformed into competent E. coli strain 2596 cells as 

described for EMP-Fc herein. Clones were screened for the ability to 

produce the recombinant protein product and to possess the gene fusion 

1 0 having the correct nucleotide sequence. A single such clone was selected 

and designated Amgen strain #4598. 

The nucleotide and amino acid sequences (SEQ ID NOS: 1069 and 

1070) of the fusion protein are shown in Figures 26A and 26B. 

* * * 

1 5 The invention now being fully described, it will be apparent to one 

of ordinary skill in the art that many changes and modifications can be 
made thereto, without departing from the spirit and scope of the invention 
as set forth herein. 



20 Abbreviations 

Abbreviations used throughout this specification are as defined 
below, unless otherwise defined in specific circumstances. 

Ac acetyl (used to refer to acetylated residues) 

AcBpa acetylated p-benzoyl-L-phenylalanine 
2 5 ADCC antibody-dependent cellular cytotoxicity 

Aib aminoisobutyric acid 

bA beta-alaitine 

Bpa p-benzoyl-L-phenylalanine 

BrAc bromoacetyl (BrCH 2 C(0) 
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BSA Bovine serum albumin 

B2I Benzyl 

Cap Caproic acid 

CTL Cytotoxic T lymphocytes 

5 CTLA4 Cytotoxic T lymphocyte antigen 4 

DARC Duffy blood group antigen receptor 

DCC Dicylcohexylcarbodiimide 

Dde i-(4 / 4-dimethyl-2 / 6-dioxo-cyclohexylidene)ethyl 

EMP Erythropoietin-mimetic peptide 

1 0 ESI-MS Electron spray ionization mass spectrometry 

EPO Erythropoietin 

Fmoc fluorenylmethoxycarbonyl 

G-CSF Granulocyte colony stimulating factor 

GH Growth hormone 

15 HCT hematocrit 

HGB hemoglobin 

hGH Human growth hormone 

HOBt 1-Hydroxybenzotriazole 

HPLC high performance liquid chromatography 

20 IL interleukin 

IL-R interleukin receptor 

IL-1R interleukin-1 receptor 

IL-lra interleukin-1 receptor antagonist 

Lau Laurie acid 

25 LPS lipopolysaccharide 

LYMPH lymphocytes 

MALDI-MS Matrix-assisted laser desorption ionizatioamass 

spectrometry 

Me methyl 

in 
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MeO methoxy 

MHC major histocompatibility complex 

MMP matrix metalloproteinase 

MMPI matrix metalloproteinase inhibitor 

5 1-Nap 1-napthylalanine 

NEUT neutrophils 

NGF nerve growth factor 

Nle norleucine 

NMP N-methyl-2-pyrrolidinone 

1 0 PAGE polyacrylamide gel electrophoresis 

PBS Phosphate-buffered saline 

Pbf 2,2 / 4 / 6 / 7-pendamethyldihydrobenzofuran-5-sulfonyl 

PCR polymerase chain reaction 

Pec pipecolic acid 

15 PEG Polyethylene glycol) 

pGlu pyroglutamic acid 

Pic picolinic acid 

PLT platelets 

pY phosphotyrosine 

20 RBC red blood cells 

RBS ribosome binding site 

RT room temperature (25 °C) 

Sar sarcosine 

SDS sodium dodecyl sulfate 

25 STK serine-threonine kinases 

t-Boc tert-Butoxycarbonyl 

tBu tert-Butyl - _ 

TGF tissue growth factor 

THF thymic humoral factor 

m 
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TK 


tyrosine kinase 


TMP 


Thrombopoietin-mimetic peptide 


TNF 


Tissue necrosis factor 


TPO 


Thrombopoietin 


TRAIL 


TNF-related apoptosis-inducing ligand 


Trt 


trityl 


UK 


urokinase 


UKR 


urokinase receptor 


VEGF 


vascular endothelial cell growth factor 


VIP 


vasoactive intestinal peptide 


WBC 


white blood cells 
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What is claimed is: 

1 . A composition of matter of the formula 

and multimers thereof, wherein: 
5 F 1 is an Fc domain; 

X 1 and X 2 are each independently selected from -(V) c -P\ - 
(LVP'-OA -P 2 , -(^VP^LVP 2 ^ 3 )^ and ALX-p-tiX-p-iV), -P 3 - 
(LVP 4 

P 1 , P 2 , P 3 , and P 4 are each independently sequences of 
1 0 pharmacologically active peptides; 

L\ V, L 3 , and V are each independently linkers; and 
a, b, c, d, e, and f are each independently 0 or 1, provided 
that at least one of a and b is 1. 

2. The composition of matter of Claim 1 of the formulae 
is X'-F 1 

or 

F'-X 2 . 

3. The composition of matter of Claim 1 of the formula 

F , -(L 1 ) C -P t . 

20 4. The composition of matter of Claim 1 of the formula 

FMLVP'-fl-VP 2 - 

5. The composition of matter of Claim 1 wherein F 1 is an IgG Fc 
domain. 

6. The composition of matter of Claim 1 wherein F 1 is an IgGl Fc 
2 5 domain. 

7. The composition of matter of Claim 1 wherein F 1 comprises the 
sequence of SEQ ID NO: 2. 

8. The composition of matter of Claim 1 wherein X 1 and X 2 comprise 
an IL-1 antagonist peptide sequence. 

U9 



The composition of matter of Claim 8 wherein the IL-1 antagonist 
peptide sequence is selected from SEQ ID NOS: 212, 907, 908, 909, 
910, 917, and 979. 

The composition of matter of Claim 8 wherein the IL-1 antagonist 
peptide sequence is selected from SEQ ID NOS: 213 to 271, 671 to 
906, 911 to 916, and 918 to 1023. 

The composition of matter of Claim 8 wherein F 1 comprises the 
sequence of SEQ ID NO: 2. 

The composition of matter of Claim 1 wherein X 1 and X 2 comprise 
an EPO-mimetic peptide sequence. 

The composition of matter of Claim 12 wherein the EPO-mimetic 
peptide sequence is selected from Table 5. 
The composition of matter of Claim 12 wherein F 1 comprises the 
sequence of SEQ ID NO: 2. 

The composition of matter of Claim 12 comprising a sequence 
selected from SEQ ID NOS: 83, 84, 85, 124, 419, 420, 421, and 461. . 
The composition of matter of claim 12 comprising a sequence 
selected from SEQ ID NOS: 339 and 340. 
The composition of matter of Claim 12 comprising a sequence 
selected from SEQ ID NOS: 20 and 22. 

The composition of matter of Claim 3 wherein P 1 is a TPO-mimetic 
peptide sequence. 

The composition of matter of Claim 18 wherein P 1 is a TPO-mimetic 
peptide sequence selected from Table 6. 

The composition of matter of Claim 18 wherein F 1 comprises the 
sequence of SEQ ID NO: 2. 

The composition of matter of Claim 18 having a sequence selected 
from SEQ ID NOS: 6 and 12. 

A DNA encoding a composition of matter of any of Claims 1 to 21. 

tn 



An expression vector comprising the DNA of Claim 22. 

A host cell comprising the expression vector of Claim 23. 

The cell of Claim 24, wherein the cell is an E.coli cell. 

A process for preparing a pharmacologically active compound, 

which comprises 

a) selecting at least one randomized peptide that modulates the 
activity of a protein of interest; and 

b) preparing a pharmacologic agent comprising at least one Fc 
domain covalently linked to at least one amino acid sequence 
of the selected peptide or peptides. 

The process of Claim 26, wherein the peptide is selected in a process 
comprising screening of a phage display library, an E. coli display 
library, a ribosomal library, or a chemical peptide library. 
The process of Claim 26, wherein the preparation of the 
pharmacologic agent is carried out by: 

a) preparing a gene construct comprising a nucleic acid 
sequence encoding the selected peptide and a nucleic acid 
sequence encoding an Fc domain; and 

b) expressing the gene construct. 

The process of Claim 26, wherein the gene construct is expressed in 
an E. coli cell. 

The process of Claim 26, wherein the protein of interest is a cell 
surface receptor. 

The process of Claim 26, wherein the protein of interest has a linear 
epitope. 

The process of Claim 26, wherein the protein of interest is a 
cytokine receptor. 

The process of Claim 26, wherein the peptide is an EPO-mimetic 
peptide. 

Hi 
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34. The process of Claim 26, wherein the peptide is a TPO-mimetic 
peptide. 

35. The process of Claim 26, wherein the peptide is an IL-1 antagonist 
peptide. 

5 36. The process of Claim 26, wherein the peptide is an MMP inhibitor 
peptide or a VEGF antagonist peptide. 

37. The process of Claim 26, wherein the peptide is a TNF-antagonist 
peptide. 

38. The process of Claim 26, wherein the peptide is a CTLA4-mimetic 
10 peptide. 

39. The process of Claim 26, wherein the peptide is selected from 
Tables 4 to 20. 

40. . The process of Claim 26, wherein the selection of the peptide is 

carried out by a process comprising: 
15 a) preparing a gene construct comprising a nucleic acid 

sequence encoding a first selected peptide and a nucleic acid 
sequence encoding an Fc domain; 
b) conducting a polymerase chain reaction using the gene 
construct and mutagenic primers, wherein 
20 i) a first mutagenic primer comprises a nucleic acid 

sequence complementary to a sequence at or near the 
5' end of a coding strand of the gene construct, and 
ii) a second mutagenic primer comprises a nucleic acid 
sequence complementary to the 3' end of the 
2 5 noncoding strand of the gene construct. 

41. The process of Claim 26, wherein the compound is derivatized. 

42. The process of Claim 26, wherein the derivatized compound 
comprises a cyclic portion, a cross-linking site, a non-peptidyl 



WO 00/24782 



PCT/US99/25044 



linkage, an N-terminal replacement, a C-terminal replacement, or a 
modified amino acid moiety. 
43. The process of Claim 26 wherein the Fc domain is an IgG Fc 
domain. 

5 44. The process of Claim 26, wherein the vehicle is an IgGl Fc domain. 

45. The process of Claim 26, wherein the vehicle comprises the 
sequence of SEQ ID NO: 2. 

46. The process of Claim 26, wherein the compound prepared is of the 
formula 

and mul timers thereof,wherein: 
F 1 is an Fc domain; 

X 1 and X 2 are each independently selected from -(V) c -P\ - 
(LX-P-dX -P 2 , -(LX-P^LX-P-aJ)^, and -(L 1 ) c -P , .(L 2 ) d -P 2 -(L\ -P 3 - 
15 (LVP 4 

P\ P 2 , P 3 , and P 4 are each independently sequences of 
pharmacologically active peptides; 

L 1 , L 2 , L 3 , and L 4 are each independently linkers; and 
a, b, c, d, e, and f are each independently 0 or 1, provided 
2 0 that at least one of a and b is 1 . 

47. The process of Claim 46, wherein the compound prepared is of the 
formulae 

X 1 -F 1 

or 

25 r-x 2 . 

48. The process of Claim 46, wherein the compound prepared is of the 
formulae 
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or 

FMLVP'-fl-VP 2 

49. The process of Claim 46, wherein F 1 is an IgG Fc domain. 

50. The process of Claim 46, wherein F 1 is an IgGl Fc domain. 

51. The process of Claim 46, wherein F 1 comprises the sequence of SEQ 
ID NO: 2. 
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ATGGACAAAACTCACACATGTCCACCTTGTCCAGCTCCGGAACTCCTGGGGGGACCGTCA 

TACCTGTTTTGAGTGTGTACAGGTGGAACAGGTCGAGGCCTTGAGGACCCCCCTGGCAGT 

MDKTHTCPPCPAPELLGGPS 

GTCTTCCTCTTCCCCCCAAAACCCAAGGACACCCTCATGATCTCCCGGACCGCTGAGGTC 

CAGAAGGAGAAGGGGGGTTTTGGGTTCCTGTGGGAGTACTAGAGGGCCTGGGGACTCCAG 

VFLPPPKPKDTLMISRTPEV 

ACATGCGTGGTGGTGGACGTGAGCCACGAAGACCCTGAGGTCAAGTTCAACTGGTACGTG 

121 ♦ + + + + + 180 

TGTACGCACCACCACCTGCACTCGGTGCTTCTGGGACTCCAGTTCAAGTTGACCATGCAC 

TCVVVDVSHEDPEVKFNWYV 

GACGGC GTGGAGGTGC ATAATGCC AAGACAAAGCC GCGGGAGGAGCAGTACAAC AGCACG 

181 + + + + + + 240 

CTGCCGCACCTCCACGTATTACGGTTCTGTTTCGGCGCCCTCCTCGTCATGTTGTCGTGC 

DGVEVHNAKTKPREEQYNST 

TACCGTGTGGTCAGCGTCCTCACCGTCCTGCACCAGGACTGGCTGAATGGCAAGGAGTAC 

241 + + ♦ ♦ ♦ + 300 

ATGGCACACCAGTCGCAGGAGTGGCAG^ACGTGGTCCTGACCGACTTACCGTTCCTCATG 

Y. RVVSVLTVLHQDWLNGKEY 

AAGTGCAAGGTCTCCAACAAAGCCCTCCCAGCCCCCATCGAGAAAACCATCTCCAAAGCC 

301 + + + + ♦ ♦ 360 

TTC AC GTTC C AGAGGTTGTTTC GGGAGGGTC GGGGGT AGCTC TTTTGGT AG AGGTTTC GG 

KCKVSNKALPAPIEKTISKA 

AAAGGGCAGCCCCGAGAACCACAGGTGTACACCCTGCCCCCATCCCGGGATGAGCTGACC 

361 + ♦ + ♦ + ♦ 420 

TTTCCCGTCGGGGCTCTTGGTGTCC AC ATGTGGGAC GGGGGT AGGGCCCT AC TCGACTGG 

KGQ P RE PQVYTL P PS RDELT 

AAGAACCAGGTCAGCCTGACCTGCCTGGTCAAAGGCTTCTATCCCAGCGACATCGCCGTG 

421 + * + ♦ + + 480 

TTCTTGGTCC AGTCGGACTGGACGGAC C AGTTTCCGAAGATAGGGTC GCTGTAGCGGC AC 

KNQVSLTCLVKGFYPSDIAV 

GAGTGGGAGAGCAATGGGCAGCCGGAGAACAACTACAAGACCACGCCTCCCGTGCTGGAC 

481 ♦ + + + + + 540 

CTCACCCTCTCGTTACCCGTCGGCCTCTTGTTGATGTTCTGGTGCGGAGGGCACGACCTG 

EWESNGQPENNYKTTPPVLD 

TCCGACGGCTCCTTCTTCCTCTACAGCAAGCTCACCGTGGACAAGAGCAGGTGGCAGCAG 

541 ♦ ♦ + + ♦ + 600 

AGGC TGCCGAGGAAGAAGGAGATGTCGTTC GAGTGGCACCTGTTCTC GTCC ACCGTC GTC 

SDGSFFLYSKLTVDKSRWQQ 

GGGAACGTCTTCTCATGCTCCGTGATGCATGAGGCTCTGCACAACCACTACACGCAGAAG 

601 ♦ ♦ * ♦ ♦ ♦ 660 

CCCTTGCAGAAGAGTACGAGGCACTACGTACTCCGAGACGTGTTGGTGATGTGCGTCTTC 

GNVFSC SVMHEALHNHYTQK 

AGCCTCTCCCTGTCTCCGGGTAAA 

661 + ♦ 684 

TCGGAGAGGGACAGAGGCCCATTT 
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FIG. 7 

I 

TCTAGATTTGTTTTAACTAATTAAAGGAGGAATAACATATGGACAAAACTCACACATGTC 

1 + + ♦ ♦ ♦ > 60 

AGATCTAAACAAAATTGATTAATTTCCTCCTTATTGTATACCTGTTTTGAGTGTGTACAG 

MDKTHTCP- 

CACCTTGTCCAGCTCCGGAACTCCTGGGGGGACCGTCAGTCTTCCTCTTCCCCCCAAAAC 
61 " * ----«. ♦ ♦ + + 120 

GTGGAACAGGTCGAGGCCTTGAGGACCCCCCTGGCAGTCAGAAGGAGAAGGGGGGTTTTG 
PCPAPELLGGPSVFLPPPKP- 

CCAAGGACACCCTCATGATCTCCCGGACCCCTGAGGTCACATGCGTGGTGGTGGACGTGA 
121 ........... + + + 18Q 

GGTTCCTGTGGGAGTACTAGAGGGCCTGGGGACTCCAGTGTACGCACCACCACCTGCACT 
KDTLMISRTPEVTCVVVDVS- 

GCCACGAAGACCCTGAGGTCAAGTTCAACTGGTACGTGGACGGCGTGGAGGTGCATAATG 
181 - - - - - + ♦ ♦ ♦ + «, 24Q 

CGGTGCTTCTGGGACTCCAGTTCAAGTTGACCATGCACCTGCCGCACCTCCACGTATTAC 
HEDPEVKFNWYVDGVEVHNA- 

CCAAGACAAAGCCGCGGGAGGAGCAGTACAACAGCACGTACCGTGTGGTCAGCGTCCTCA 
241 + ♦ ♦ ♦ ♦ ♦ 300 

GGTTCTGTTTCGGCGCCCTCCTCGTCATGTTGTCGTGCATGGCACACCAGTCGCAGGAGT 
KTKPREEQYM3TYRVVSVLT- 

CCGTCCTGCACCAGGACTGGCTGAATGGCAAGGAGTACAAGTGCAAGGTCTCCAACAAAG 

301 ♦ ♦ ♦ ♦ ♦ + 360 

GGCAGGACGTGCTCCTGACCGACTTACCGTTCCTCATGTTCACGTTCCAGAGGTTGTTTC 
V LHQOWLNGKEYKCKVS N K A - 

CCCTCCCAGCCCCCATCGAGAAAACCATCTCCAAAGCCAAAGGGCAGCCCCGAGAACCAC 

361 ♦ ♦ ♦ * 4 4 20 

GGGAGGGTCGGGGGTAGCTCTTTTGGTAGAGGTTTCGCriTTC 

L PAPI EKTI 3KAKGQPRBPQ- 

AGGTGTACACCCTGCCCCCATCCCGGGATGAGCTGACCAAGAACCAGGTCAGCCTGACCT 
421 ♦ ♦ ♦ ♦ + 480 

TCCACATGTGGGACGGGGGTAGGGCCCTACTCGACTGGTTCTTGCTCCAGTCGGACTGGA 
VYTLPPS ROELTKNQVS LTC* 

GCCTGGTCAAAGGCTTCTATCCCAGCGACATCGCCGTGGAGTGGGAGAGCAATGGGCAGC 
481 + ♦ + ♦ ♦ + 540 

CGGACCAGTTTCCGAAGATAGGGTCGCTGTAGCGGCACCTCACCCTCTCGTTACCCGTCG 
LVKGFYPSDIAVBWBSNGQp. 

CGGAGAACAACTACAAGACCACGCCTCCCGTGCTGGACTCCGACGGCTCCTTCTTCCTCT 
541 ♦ ♦ ♦ + + 600 

GCCTCTTGTTGATGTTCTGGTGCGGAGGGCACGACCTGAGGCTGCCGAGGAAGAAGGAGA 
EN NYKTT PPVLDSDG3 F FLY- 

ACAGCAAGCTCACCGTGGACAAGAGCAGGTGGCAGCAGGGGAACGTCTTCTCATGCTCCG 
601 ♦ + + ♦ + + 660 

TGTCGTTCGAGTGGCACCTGTTCTCGTCCACCGTCGTCCCCTTGCAGAAGAGTACGAGGC 
SKLTVDKSRWQQGNVFSCSV- 

TGATGCATGAGGCTCTGCACAACCACTACACGCAGAAGAGCCTCTCCCTGTCTCCGGGTA 
661 + + ♦ ♦ ♦ + 720 

actacgtactccgagacgtgttgqtgatgtgcgtcttctcggagagggacagaggcccat 
mhbalhnhytqkslslsp~gk- 

aaggtggaggtggtggtatcgaaggtccgactctgcgtcagtggctggctgctcgtgctt 

721 ♦ ♦ ♦ ♦ ♦ ♦ 780 

TTCCACCTCCACCACCATAGCTTCCAGGCTGAGACGCAGTCACCGACCGACGAGCACGAA 
GGGGGI EGPTLRQWLAA R A * - 

BaxnHI 
I 

AATCTCGAGGATCC 

781 ♦ 794 

TTAGAGCTCCTAGG 
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Xbal 

I 

TCTAGATTTGTTTTAACTAATTAAAGGAGCAATAACATATGCACAAAACTCACACATGTC 

1 ♦ ♦ * + ♦ ♦ ♦ 60 

AGATCTAAACAAAATTGATTAATTTCCTCCTTATTGTATACCTGTTTTGAGTGTCTACAG 

MDKTHTCP- 

CACCTTGTCCAGCTCCGGAACTCCTGGGGGGACCGTCAGTCTTCCTCTTCCCCCCAAAAC 

61 ♦ ♦ ♦ ♦ ♦ ♦ 120 

GTGGAACAGGTCGAGGCCTTGAGGACCCCCCTGGCACTCAGAAGGACAAGGGGGGTTTTG 
PCPAPELLGGPSVPLFPPKP* 

CCAAGGACACCCTCATGATCTCCCGGACCCCTGAGGTCACATGCGTGGTGGTGGACGTGA 

121 «■ ♦ ♦ ♦ + 180 

GGTTCCTGTGGGAGTACTAGAGGGCCTGGGGACTCCAGTGTACGCACCACCACCTGCACT 
KDTLMI SRTPEVTCVVVDV3- 

GCCACGAAGACCCTGAGGTCAAGTTCAACTGGTACGTGGACGGCGTGGAGGTGCATAATG 

181 ♦ «■ ♦ ♦ ♦ + 240 

CGGTGCTTCTGGGACTCCAGTTCAAGTTGACCATGCACCTGCCGCACCTCCACGTATTAC 
HEDPEVKFNWYVDGVEVHNA- 

CCAAGACAAAGCCGCGGGAGGAGCAGTACAACAGCACGTACCGTGTGGTCAGCGTCCTCA 

241 ♦ ♦ ♦ ♦ ♦ <► 300 

GGTTCTGTTTCGGCGCCCTCCTCGTCATGTTGTCGTGCATGGCACACCAGTCGCAGGAGT 
KTKPREEQYN3TYRVV3VLT- 

CCGTCCTGCACCAGGACTGGCTGAATGGCAAGGAGTACAAGTGCAAGGTCTCCAACAAAG 

301 ♦ ♦ ♦ ♦ ♦ + 360 

GGCAGGACGTCWTCCTGACCGACTTACCGTTCCTCATGTTCACGTTC 

VLHQDWLNGKEYKCKVSNKA- 

CCCTCCCAGCCCCCATCGAGAAAACCATCTCCAAAGCCAAAGGGCAGCCCCGAGAACCAC 

361 ♦ ♦ ♦ «• + 420 

GGGAGGGTCGGGGGTAGCTCTTTTGCTAGAGGTTTCGGTTTCCCGTC 

LPAPIEKTISKAKGQPREPQ- 

AGGTGTACACCCTGCCCCCATCCCGGGATGAGCTGACCAAGAACCAGGTCAGCCTGACCT 

421 ♦ ♦ ♦ * ♦ ♦ 480 

TCCACATGTGGGACGGGGGTAGGGCCCTACTCGACTGGTTCTTGGTCCAGTCGGACTGGA 
VYTLPP3R0ELTKNQVSLTC- 

GCCTGGTCAAAGGCTTCTATCCCAGCGACATCGCCGTGGAGTGGGAGAGCAATGGGCAGC 

481 - ♦ ♦ + ♦ ♦ ♦ 540 

CGGACCAGTTTCCGAAGATAGGGTCGCTGTAGCGGCACCTCACCCTCTCGTTACCCGTCG 
LVKGFYP3DIAVEWE3NGQP- 

CGGAGAACAACTACAAGACCACGCCTCCCGTGCTGGACTCCGACGGCTCCTTCTTCCTCT 

541 ♦ ♦ ♦ + ♦ ♦ 600 

GCCT CT T GT TGATGTTCTGGTGCGGAGGGCACGACCTGAGGCTGCCGAGGAAGAAGCAGA 
EMNYKTTP PVLD 3 DGS P F L Y - 

ACAGCAAGCTCACCGTGGACAAGAGCAGGTGGCAGCAGGGGAACGTCTTCTCATGCTCCG 

601 ♦ ♦ ♦ ♦ ♦ > 660 

TGTCGTTCGAGTGGCACCTGTTCTCGTCCACCGTCGTCCCCTTGCAGAAGAGTACGAGGC 
3 KLTVDKS RWQQGNVFSCSV- 

TGATGCATOAGGCTCTGCACAACCACTACACGCAGAAGAGCCTCTCCCTGTCTCCGGGTA 

661 ♦ ♦ ♦ ♦ ♦ ♦ 720 

ACTACGTACTCCGAGACGTGTTGGTGATGTGCGTCTTCTCGGAGAGGGACAGAGGCCCAT 
MHEALHNHYTQK S L3 L3 PGK- 

AAGGTGGAGGTGGTGGTATCGAAGGTCCGACTCTGCGTCAGTGGCTGGCTGCTCGTGCTG 

721 ♦ ♦ * + ♦ + 780 

TTCCACCTCCACCACCATAGCrfCCAGGCTGAGACGCAGTCACCGACCGACGAGCACGAC 
GG GGG I EG PT L R QWLAA RAG- 

GTGGTGGAGGTGGCGGCGGAGGTATTGAGGGCCCAACCCTTCGCCAATGGCTTGCAGCAC 

781 ♦ + ♦ ♦ ♦ ♦ 840 

CACCACCTCCACCGCCGCCTCCATAACTCCCGGGTTGGGAAGCGGTTACCGAACGTCGTG 
GGGGGGGI EG PT LRQWLAAR- 

BamHI 
I 

GCGCATAATCTCGAGGATCCG 

841 ♦ ♦ • 861 

C GCGT ATT AGAGCTCCT AGGC 

C A * - 
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I 

TCTAGATTTGTTTTAACTAATTAAAGGAGGAATAACATATGATCGAAGCTCCGACTCTGC 

1 • > ♦ ♦ * + ♦ 60 

AGATCTAAACAAAATTGATTAATTTCCTCCTTATTGTATACTAGCTTCCAGGCTGAGACG 

MIBGPTLR- 

GTCAGTGGCTGGCTGCTCGTGCTGGCGGTGGTGGCGGAGGGGGTGGCATTGAGGGCCCAA 

61 ♦ ♦ * ♦ ♦ ♦ 120 

CAGTCACCGACCGACCAGCACGACCGCCACCACCGCCTCCCCCACCGTAACTCCCGGGTT 
QWLAARAGGGGGGGG I EG P T • 

CC CTTCGCCAATGGCTTGC AGC ACGCGC AGGGGGAGGCGGTGGGGAC AAAACTCACACAT 

121 ♦ ♦ ♦ + ♦ * 180 

GGGAAGCGGTTACCGAACGTCGTGCGCGTCCCCCTCCGCCACCCCTGTTTTGAGTGTGTA 
LRQWLAARAGGGGGDRTHTC- 



GTCCACCTTGCCCAGCACCTGAACTCCTGGGGGGACCGTCAGTTTTCCTCTTCCCCCCAA 

181 ♦ + ♦ ♦ + + 240 

CAGGTGGAACGGGTCGTGGACTTGAGGACCCCCCTGGCAGTCAAAAGGAGAAGGGGGGTT 
P PCPAPELLGGP3VPLFPPK* 



AACCCAAGGACACCCTCATGATCTCCCGGACCCCTGAGGTCACATGCGTGGTGGTGGACG 

241 ♦ ♦ ♦ ♦ «■ ♦ 300 

TTGGGTTCCTGTGGGAGTACTAGAGGGCCTGGGGACTCCAGTGTACGCACCACCACCTGC 
PKOTLMX SRTPEVTCVVVDV- 

TGAGCCACGAAGACCCTGAGGTCAAGTTCAACTGGTACGTGGACGGCGTGGAGGTGCATA 

301 ♦ + ♦ ♦ ♦ ♦ 360 

ACTCGGTGCTTCTGGGACTCCAGTTCAACTTGACCATGCACCTGCCGCACCTCCACGTAT 
SHEDPEVKPNWYVDOVBVHN- 

ATGCCAAGACAAAGCCGCGGGAGGAGCAGTACAACAGCACGTACCGTCTGGTCAGCGTCC 
361 ♦ ♦ ♦ ♦ + ♦ 420 

AKTKPRBBQYNSTVRVVSVL- 

TCACCGTCCTGCACCAGGACTGGCTGAATGGCAAGGAGTACAAGTGCAAGGTCTCCAACA 

421 + ♦ ♦ «■ + + 480 

AGTGGCAGGACGTGGTCCTGACCGACTTACCGTTCCTCATGTTCACGTTCCAGAGGTTGT 
TVLHQDWLNGKBYKCKVSNK- 

AAGCCCTCCCAGCCCCCATCGAGAAAACCATCTCCAAAGCCAAAGGGCAGCCCCGACAAC 

481 + ♦ ♦ + + ♦ 540 

TTCGGGAGGGTCGGGGGTAGCTCTTTTGGTAGAGGTTTCGGTTTCCCCT 

ALPAPIEKTI 3KAKGQPR2P* 

CACAGCTGTACACCCTGCC<;CCATCCCGGGATGAGCTGACCAAGAACCAGGTCAGCCTGA 

541 ♦ ♦ ♦ > ♦ ♦ 600 

GTGTCCACATGTGGGACGGGGGTAGGGCCCTACTCGACTGGTTCTTGCTCCAGTCGGACT 
QVYTLPPSRDBLTKNQVS LT* 

CCTGCCTGGTCAAAGGCTTCTATCCCAGCGACATCGCCGTGGAGTGGGAGAGCAATCC3GC 

601 * + ♦ + + ♦ 660 

GGACGGACCAGTTTCCGAAGATAGGGTCGCTGTAGCGGCACGTCACCCTCTCGTTACCCG 
CLVKGFYPSDIAVBWB3MGQ- 

AGCCGGAGAACAACTACAAGACCACGCCTCCCGTGCTGGACTCCGACGGCTCCTTCTTCC 
661 ♦ ♦ * ♦ ♦ ♦ 720 



TCGGCCTCTTGTTGATGTTCTGGTGCGCAGGGCACGACCTGAGGCTGCCGAGGAAGAAGG 
PENNYKTTPPVLD3DGS FPL- 



TCTACAGCAAGCTCACCGTGGACAAGAGCAGGTGGCAGCAGGGGAACGTCTTCTCATGCT 

721 ♦ ♦ * ♦ + + 780 

AGATGTCGTTCGAGTGGCACCTGTTCTCGTCCACCGTCGTCCCCTTGCAGAXGAGTACGA 
Y3KLTVDKSRWQQGNVFSC3- 

CCGTGATGCATGAGGCTCTGCACAACCACTACACGCAGAAGAGCCTCTCCCTGTCTCCGG 

781 ♦ ♦ ♦ ♦ + ♦ 840 

GGCACTACGTACTCCGAGACGTGTTGGTGATGTGCGTCTTCTCGGAGAGGGACAGAGGCC 
VMHEALHNHYTQKSLSL3PG- 

BamHX 

I 

GTAAATAATGGATCC 

841 ♦ 855 

CATTTATTACCTAGG 
K ♦ 
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Xbal 



FIG. 10 



TCTAGATTTGTTTTAACTAATTAAAGGAGGAATAACATATGATCGAAGGTCCGACTCTGC 

1 + ♦ ♦ ♦ + + 60 

AG ATC T AAAC AAAATTGATTAATTTCCTC CTTATTGTAT ACTAGCTTCC AGGCTGAGACG 

MIEGPTLR- 

GTCAGTGGCTGGCTGCTCGTGCTGGTGGAGGCGGTGGGGACAAAACTCACACATGTCCAC 

61 ♦ ♦ ♦ ♦ ♦ ♦ 120 

CAGTCACCGACCGACGAGCACGACCACCTCCGCCACCCCTGTTTTGAGTGTGTACAGGTG 
QW'LAARAGGGGGDKTHTCPP- 

CTTGCCCAGCACCTGAACTCCTGGGGGGACCGTCAGTTTTCCTCTTCCCCCCAAAACCCA 

121 + ♦ ♦ ♦ ♦ ♦ 180 

GAACGGGTCGTGGACTTGAGGACCCCCCTGGCAGTCAAAAGGAGAAGGGGGGTTTTGGGT 
C PAPELLGGPSVFLFP P K P K - 

AGGACACCCTCATGATCTCCCGGACCCCTGAGGTCACATGCGTGGTGGTGGACGTGAGCC 

181 + + ♦ ♦ ♦ + 240 

TCCTGTGGGAGTACTAGAGGGCCTGGGGACTCCAGTGTACGCACCACCACCTGCACTCGG 
DTLMISRTPBVTCVVVDVSH- 

ACGAAGACCCTGAGGTCAAGTTCAACTGGTACGTGGACGGCGTGGAGGTGCATAATGCCA 

241 ♦ •■ ♦ ♦ ♦ ♦ 300 

TGCTTCTGGGACTCCAGTTCAAGTTGACCATGCACCTGCCGCACCTCCACGTATTACGGT 
EDPBVKFNWYVDGVBVHNAK- 

AGACAAAGCCGCGGGAGGAGCAGTACAACAGCACGTACCGTGTGGTCAGCGTCCTCACCG 

301 + ♦ ♦ ♦ ♦ + 360 

TCTGTTTCGGCGCCCTCCTCGTCATGTTGTCGTGCATGGCACACCAGTCGCAGGAGTGGC 
TKPREBQYNSTYRVVS VLTV- 

TCCTGCACCAGGACTGGCTGAATGGCAAGGAGTACAAGTGCAAGGTCTCCAACAAAGCCC 

361 ♦ ♦ ♦ ♦ ♦ «" 420 

AGGACGTGGTCCTGACCGACTTACCGTTCCTCATGTTCACGTTCCAGAGGTTGTTTCGGG 
LHQDWLNGKEYKCKVSNKAL- 

TCCCAGCCCCCATCGAGAAAACCATCTCCAAAGCCAAAGGGCAGCCCCGAGAACCACAGG 

421 ♦ ♦ ♦ ♦ ♦ * 480 

AGGGTC GGGGGTAGC TCTTTTGGT AG AGGTTTCGGTTTC CC GTC GGGGC TCTTGGTGTC C 
PAPIEKTISKAKGQPREPQV- 

TGTACACCCTGCCCCCATCCCGGGATGAGCTGACCAAGAACCAGGTCAGCCTGACCTCCC 

481 ♦ ♦ + ♦ ♦ ♦ 540 

ACATGTGGGACGGGGGTAGGGCCCTACTCGACTGGTTCTTGGTCCAGTCGGACTGGACGG 
YTLPPSRDELTKNQVSLTCL- 

TGGTCAAAGGCTTCTATCCCAGCGACATCGCCGTGGAGTGGGAGAGCAATGGGCAGCCGG 

541 ♦ ♦ ♦ ♦ ♦ ♦ 600 

ACCAGTTTCCGAAGATAGGGTCGCTGTAGCGGCACCTCACCCTCTCGTTACCCGTCGGCC 
V KG P Y P S D I A V E W E SNG Q P E - 

AGAACAACTACAAGACCACGCCTCCCGTGCTGGACTCCGACGGCTCCTTCTTCCTCTACA 
601 ♦ ♦ + ♦ ♦ + 660 



NNYKTTPPVLDSDG3FFLYS- 

GC AAGCTC ACCGTGGACAAGAGC AGGTGGC AGC AGGGGAACGTCTTCTC ATGCTCCGTGA 

661 ♦ ♦ ♦ ♦ ♦ + 720 

C GTTCGAGTGGCAC CTGTTCTCGTCC ACCGTCGTCCCCTTGCAGAAGAGT AC GAGGC ACT 
KLTVDKSRWQQGNVPSC""SVM- 

TGCATGAGGCTCTGCACAACCACTACACGCAGAAGAGCCTCTCCCTGTCTCCGGGTAAAT 

721 ♦ ♦ ♦ ♦ + + 780 

ACGTACTXTCGAGACGTGTTGGTGATGTGCGTCTTCTCGGAGAGGGACAGAGGCCCATTTA 
HEALHNHYTQKSL3L3 P G K * - 

BanHX 

I 

AATGGATCC 

781 789 

TTACCTAGG 
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FIG. 13 

I 

TCTAGATTTGTTTTAACTAATTAAAGGAGGAATAACATATGGACAAAACTCACACATGTC 

I .+ * ♦ ♦ + 60 

AGATCTAAACAAAATTGATTAATTTCCTCCTTATTGTATACCTGTTTTCAGTGTGTACAC 

MDKTHTC P - 
CACCTTGTCCAGCTCCGGAACTCCTGGGGGGACCGTCAGTCTTCCTCTTCCCCCCAAAAC 

61 ' ♦ ♦ ♦ ♦ ♦ * 120 

GTGGAACAGGTCGAGGCCTTGAGGACCCCCCTGGCAGTCAGAAGGAGAAGGGGGGTTTTG 
PCPAPELLGGPSVFLFPPKP* 

CCAAGGACACCCTCATGATCTCCCGGACCCCTGAGGTCACATGCGTGCTGGTGGACGTGA 
121 ♦ ♦ ♦ ♦ ♦ + 180 

GGTTCCTGTGGGAGTACTAGAGGGCCTGGGGACTCCAGTGTACGCACCACCACCTGCACT 
KDTLMISRTPEVTC VVVDVS- 

GC C ACGAAG ACC C TGAGGTC AAGTTC AACTGGT AC GTGGAC GGCGTGG AGGTGC AT AATG 

181 ♦ ♦ + + ♦ + 240 

CGGTGCTTCTGGGACTCCAGTTCAAGTTGACCATGCACCTGCCGCACCTCCACGTATTAC 
HBDPEVKPNWYVDGVBVH NA- 

CCAAGACAAAGCCGCGGGAGGAGCAGTACAACAGCACGTACCGTGTGGTCAGCGTCCTCA 

241 + + ♦ ♦ + ♦ 300 

GGTTCTGTTTCGGCGCCCTCCTCGTCATGTTGTCGTGCATGGCACACCAGTCGCAGGAGT 
KTKPREBQYNSTYRVVSVLT- 

CCGTCCTGCACCAGGACTGGCTGAATGGCAAGGAGTACAAGTGCAAGGTCTCCAACAAAG 

301 ♦ ♦ + + ♦ ♦ 360 

GGCAGGACGTGGTCCTGACCGACTTACCGTTCCTCATGTTCACGTTCCAGAGGTTGTTTC 
VLHQDWLNGKEYKCKV SNKA- 

CCCTCCCAGCCCCCATCGAGAAAACCATCTCCAAAGCCAAAGGGCAGCCCCGAGAACCAC 

361 ♦ + ♦ + ♦ ♦ 420 

GGGAGGGTCGGGGGTAGCTCTTTTGGTAGAGGTTTCGGTTTCCCGTCGGGGCTCTTGG 
L PAPI EKTI SKAKGQPRE P Q - 

AGGTGTACACCCTGCCCCCATCCCGGGATGAGCTGACCAAGAACCAGGTCAGCCTGACCT 

421 ♦ + + + + + 480 

TCCACATGTGGGACGGGGGTAGGGCCCTACTCGACTGGTTCTTGGTCCAGTCGGACTGGA 
VYTLPP3RDBLTKNQVSLTC- 

GCCTGGTCAAAGGCTTCTATCCCAGCGACATCGCCGTGGAGTGGGAGAGCAATGGGCAGC 

481 ♦ ♦ ♦ ♦ ♦ + 540 

CGGACCAGTTTCCGAAGATAGGGTCGCTGTAGCGGCACCTCACCCTCTCGTTACCCGTCG 

LVKGPY PSDIAVEWE3 NGQ P - 
CGGAGAACAACTACAAGACCACGCCTCCCGTGCTGGACTCCGACGGCTCCTTCTTCCTCT 

541 ♦ ♦ + ♦ + ♦ 600 

GCCTCTTGTTGATGTTCTGGTGCGGAGGGCACGACCTGAGGCTGCCGAGGAAGAAGGAGA 
ENNYKTTPPVLDSDGS F F L Y - 

ACAGCAAGCTCACCGTGGACAAGAGCAGGTGGCAGCAGGGGAACGTCTTCTCATGCTCCG 

601 ♦ ♦ ♦ ♦ ♦ ♦ 660 

TGTCGTTCGAGTGGCACCTGTTCTCGTCCACCGTCGTCCCCTTGCAGAAGAGTACGAGGC 
SKLTVDKSRWQQGNVPSCSV- 

TGATGCATGAGGCTCTGCACAACCACTACACGCAGAAGAGCCTCTCCCTGTCTCCGGGTA 

661 + ♦ + ♦ * + 720 

ACT ACGT ACTCCGAG AC GTGTTGGTGATGTGC GTCTTCTCGGAGAGGGAC AGAGGCCC AT 
MHEALHNHYTQKSLSLS PGR* 

AAGGTGGAGGTGGTGGTGGAGGTACTTACTCTTGCCACTTCGGCCCGCTGACTTGGGTTT 

721 ♦ ♦ * ♦ ♦ ♦ 780 

TTCCACCTCCACCACCACCTCCATGAATGAGAACGGTGAAGCCGGGCGACTGAACCCAAA 
GGGGGGGTYSCH PGPLTWVC- 

BamHI 
I 

GCAAACCGCAGGGTGGTTAATCTCGTGGATCC 

781 + ♦ + • • 812 

CGTTTGGCGTCCCACCAATTAGAGCACCTAGG . 
K P Q G G * 
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FIG. 14 



TCTAGATTTGTTTTAACTAATTAAAGGAGGAATAACATATGGGAGGTACTTACTCTTGCC 

1 ♦ ♦ ♦ ♦ ♦ ♦ 60 

AGATCTAAACAAAATTGATTAATTTCCTCCTTATTGTATACCCTCCATGAATGAGAACGG 

MGGTYSCH- 

ACTTCGGCCCGCTGACTTGGGTATGTAAGCCACAAGGGGGTGGGGGAGGCGGGGGGGACA 

61 ♦ ♦ + ♦ ♦ ♦ 120 

TGAAGCCGGGCGACTGAACCCATACATTCGGTGTTCCCCCACCCCCTCCGCCCCCCCTGT 
PG PLTWVC K PQGGGGGGGDK- 

AAACTCACACATGTCCACCTTGCCCAGCACCTGAACTCCTGGGGGGACCGTCAGTTTTCC 

121 --■ ♦ ♦ ♦ > ♦ + 180 

TTTGAGTGTGTACAGGTGGAACGGGTCGTGGACTTGAGGACCCCCCTGGCAGTCAAAAGG 
THTC PP C PAPELLGGP3VPL- 

TCTTCCCCCCAAAACCCAAGGACACCCTCATGATCTCCCGGACCCCTGAGGTCACATGCG 

181 ♦ ♦ ♦ ♦ ♦ + 240 

AG AAGGGGGGTTTTGGGTTC CTGTGGGAGTACTAGAGGGCCTGGGGACTCC AGTGTACGC 
FPPKPKDTLMISRTPEVTCV- 

TGGTGGTGGACGTGAGCCACGAAGACCCTGAGGTCAAGTTCAACTGGTACGTGGACGGCG 

241 ♦ ♦ ♦ ♦ ♦ ♦ 300 

ACCACCACCTGCACTCGGTGCTTCTGGGACTCCAGTTCAAGTTGACCATGCACCTGCCGC 
VVDVSHEDPBVKFNWYVDGV- 

TGGAGGTGCATAATGCCAAGACAAAGCCGCGGGAGGACCAGTACAACAGCACGTACCGTG 

301 ♦ ♦ ♦ ♦ ♦ ♦ 360 

ACCTCCACGTATTACGGTTCTGTTTCGGCGCCCTCCTCGTCATGTTGTCGTGCATGGCAC 
EVHNAKTKPREBQYNSTYRV- 

TGGTCAGCGTCCTCACCGTCCTGCACCAGGACTGGCTGAATGGCAAGGAGTACAAGTGCA 

361 ♦ + ♦ ♦ ♦ ♦ 420 

ACCAGTCGCAGGAGTGGCAGGACGTGGTCCTGACCGACTTACCGTTCCTCATGTTCACGT 
VSVLTVLHQDWLNGKEY KC K - 

AGGTCTCCAACAAAGCCCTCCCAGCCCCCATCGAGAAAACCATCTCCAAAGCCAAAGGGC 

421 ♦ ♦ ♦ ♦ ♦ ♦ 480 

TCCAGAGGTTGTTTCGGGAGGGTCGGGGGTAGCTCTTTTGGTAGAGGTTTCGGTTTCCCG 
VSNKALPAPIEKTI 3KAKGQ* 

AGCCCCGAGAACCACAGGTGTACACCCTGCCCCCATCCCGGGATGAGCTGACCAAGAACC 

481 ♦ ♦ ♦ ♦ ♦ ♦ 540 

TCGGGGCTCTTGGTGTCCACATGTGGGACGGGGGTAGGGCCCTACTCGACT^ 

PRE PQVYTL P PS RDELTRNQ- 

AGGTCAKCTCACCTGCCTGGTCAAAGGCTTCTATCCCAGCGACATCGC 

541 ♦ + ♦ ♦ ♦ + 600 

TCCAGTCGGACTGCACGGACCAGTTTCCGAAGATAGGGTCGCTGTAGCGGCACCTCACCC 
V3LTCLVKGPYPSDIAVEWE- 

AGAGCAATGGGCAGCCGGAGAACAACTACAAGACCACGCCTCCCGTGCTGGACTCCGACG 

601 ♦ ♦ ♦ + ♦ + 660 

TCTCGTTACCCGTCGGCCTCTTGTTGATGTTCTGGTGCGGAGGGCACGACCTGAGGCTGC 
SNGQPENNYKTTPPVLDSDG- 

GCTCCTTCTTCCTCTACAGCAAGCTC AC CGTGGACAAGAGCAGGTGGC AGCAGGGGAACG 

661 ♦ ♦ ----- ♦ - - +- 720 

CGAGGAAGAAGGAGATGTCGTTCGAGTGGCACCTGTTCTCGTCCACCGTCGTCCCCTTGC 
3 P P LY 3 K L TV D K 3 RWQffG-MV- 

TCTTCTCATGCTCCGTGATGCATGAGGCTCTGCACAACCACTACACGCAGAAGAGCCTCT 

721 ♦ ♦ * ♦ ♦ + 78° 

AGAAGAGTACGAGGCACTACGTACTCCGAGACGTGTTGGTGATGTGCGTCTTCTCGGAGA 
FSC3VMHEALHNHYTQK3L3- 

BaxnHI 
I 

CCCTGTCTCCGGGTAAATAATGGATCC 

781 ♦ * 807 

GGGACAGAGGCCCATTTATTACCTAGG 
L 3 P G K • 
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FIG. 15 

I 

TCTAGATTTGACTTTTAACTTTTAGAAGGAGCAATAAAATATGGGACGTACTTACTCTTG 

1 ♦ ♦ ♦ ♦ ♦ ♦ 60 

AGATCTAAACTCAAAATTGAAAATCTTCCTCCTTATTTTATACCCTCCATGAATGAGAAC 
b MGGTYSC- 

CCACTTCGGCCCACTGACTTGGGTTTGCAAACCGCAGGGTGGCGGCGGCGGCGGCGGTGG 

61 + ♦ ♦ ♦ * ♦ 120 

GGTGAAGCCGGGTGACTGAACCCAAACGTTTCGCGTCCCACCGCCGCCGCCGCCGCCACC 
b H FG P LTWVC K PQGGGGGG'GG 

TACCTATTCCTGTCATTTTGGCCCOCTGACCTGGGTATGTAAGCCACAAGGGGGTGGGGG 

121 ♦ «■ ♦ ♦ ♦ ♦ 180 

ATGGATAAGGACAGTAAAACCGGGCGACTGGACCCATACATTCGGTGTTCCCCCACCCCC 
b TY3CHFGPLTWVCKPQGGGG - 

AGGCGGGGGGGACAAAACTCACACATGTCCACCTTGCCCAGCACCTGAACTCCTGGGGGG 

181 ♦ ♦ ♦ ♦ ♦ ♦ 240 

TCCGCCCCCCCTGTTTTGAGTGTGTACAGGTGGAACGGGTCGTGGACTTGAGGACCCCCC 
b GGGDKTHTCPPCPAPELLGG 

ACCGTCAGTTTTCCTCTTCCCCCCAAAACCCAAGGACACCCTCATGATCTCCCGGACCCC 

241 ♦ ♦ ♦ ♦ ♦ 300 

TGGCAGTCAAAAGGAGAAGGGGGGTTTTGGGTTCCTGTGGGAGTACTAGAGGGCCTGGGG 
b P3VFLFPPKPKDTLMI3 RTP 

TGAGGTCACATX^GTGGTGGTGGACGTC^GCCACGAAGACCCTGAGGTCAAGTTCAACTG 

301 ♦ + ♦ ♦ ♦ + 360 

ACTCCAGTGTACC^ACCACCACCTGCACTCGGTGCTTCTGGGACTCCAGTTCAACTTGAC 
b EVTCVVVDVSHBDPBVKFNW- 

GTACGTGGACGGCGTGGAGGTGCATAATGCCAAGACAAAGCCGCGGGAGGAGCAGTACAA 

361 ♦ + ♦ ♦ ♦ + 420 

CATGCACCTGCCGCACCTCCACGTATTACGGTTCTGTTTCGGCGCCCTCCTCGTCATGTT 
b YVDGVEVHNAKTKPRBEQY N 

CAGCACGTACCGTGTGGTCAGCGTCCTCACCGTCCTGCACCAGGACTGGCTGAATGGCAA 

421 ♦ ♦ ♦ ♦ + ♦ 480 

GTCGTGCATGGCACACCAGTCGCAGGAGTGGCAGGACGTGGTCCTGACCGACTTACCGTT 
b 3TYRVVSVLTVLHQDWLNGK 

GGAGTACAAGTGCAAGGTCTCCAACAAAGCCCTCCCAGCCCCCATCGAGAAAACCATCTC 

481 ♦ ♦ + ♦ ♦ * 540 

CCTCATGTTCACGTTCCAGAGGTTGTTTCGGGAGGGTCGGGGGTAGCTCTTTTGGTAGAG 
b BYKCKVSMKALPAPIBKT IS 

CAAAGCCAAAGGGCAGCCCCGAGAACCACAGGTGTACACCCTGCCCCCATCCCGGGATGA 

541 ♦ ♦ ♦ ♦ ♦ ♦ 600 

GTTTCGGTTTCCCGTCGGGGCTCTTGGTGTCCACATGTGGGACGGGGGTAGGGCCCTACT 
b KAKGQPREPQVYTLPPSRDE 

GCTGACCAAGAACCAGGTCAGCCTGACCTGCCTGGTCAAAGGCTTCTATCCCAGCGACAT 

601 ♦ + ♦ ♦ ♦ ♦ 660 

CGACTGGTTCTTGGTCCAGTCGGACTGGACGGACC^GTTTCCGAAGATAGGGTCGCTGTA 
b LTKNQVSLTCLVKGFYP3D I 

CGCCGTGGAGTGGGAGAGCAATGGGCAGCCGGAGAACAACTACAAGACCACGCCTCCCGT 

661 ♦ ♦ ♦ ♦ + ♦ 720 

GCGGCACCTCACCCTCTCGTTACCCGTCGGCCTCTTGTTGATGTTCTGGTGCGGAGGGCA 
b AVBWE3NGQPENNYKTTPPV 

GCTGGACTCCGACGGCTCCTTCTTCCTCTACAGCAAGCTCACCGTGGACAAGAGCAGGTG 

721 ♦ ♦ + ♦ ♦ ♦ 780 

CGACCTGAGGCTGCCGAGGAAGAAGGAGATGTCGTTCGAGTGGCACCTGTTCTCGTCCAC 
b LDSDGSFFLYSKLTVDKSRW 

GCAGCAGGGGAACGTCTTCTCATGCTCCGTGATGCATGAGGCTCTGCACAACCACTACAC 

781 + + ♦ ♦ ♦ + 840 

CGTCGTCCCCTTGCAGAAGAGTACGAGGCACTACGTACTCCGAGACGTGTTGGTGATGTG 
b QQGNVPSCSVMHEALHNHYT 

BamHX 
I 

GCAGAAGAGCCTCTCCCTGTCTCCGGGTAAATAATGGATCC 

841 + ♦ + ; 881 

CGTCTTCTCGGAGAGGGACAGAGGCCCATTTATTACCTAGG 
b QKSLSL3PGK* 
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Xbal 

I 

TCTAGATTTtnTTTAACTAATTAAAGGAGGAATAACATATGGACAAAACTCACACATCTC 

1 ♦ * ♦ + + ♦ 60 

AGATCTAAACAAAATTGATTAATTTCCTCCTTATTCTATACCTCTTTTGA 
C MOKTHTCP- 

CACCTTGCCCAGCACCTGAACTCCTGGGGGGACCGTCAGTTTTCCTCTTC 

61 * ♦ ♦ ♦ ♦ 120 

GTGGAACGGGTCGTGGACTTGAGGACCCCCCTGGCAGTCAAAAGGAGAAGGGCGGTTTTG 
C PC PAPBLLGGPSVFLPPPKP- 

CC AAGGAC ACCCTC ATGATCTCC CGCAC CCCTGAGGTC AC ATGCGTGGTGCTGGAC GTG A 

121 ♦ ♦ ♦ ♦ 180 

GGTTCCTGTGGGAGTACTAGAGGGCCTGCGGACTCCAGTGTACGCACC\CCACCTGCACT 
C KDT LMI 3 RT PBVTCVVVDV S- 

GCCACGAAGACCCTGAGGTCAAGTTCAACTGGTACGTGGACGCCGTGGAGGTGCATAATG 
181 ♦ + ♦ ♦ + ♦ 240 

CGGTGCTTCTGGGACTCOVGTTCAAGTTGACCATGCACCTGCCCCACCTCCACGTATTAC 
C H ED P B V KFNWYVDGVEVHNA- 

CCAAGACAAAGCCCCGGGAGGAGCACTACAACAGCACGTACCGTGTGGTCAGCGTCCTCA 

241 ♦ ♦ ♦ ♦ ♦ ♦ 300 

GGTTCTCTTTCGGC GC CC TCCTC GTC ATGTTGTCGTGCATGGC ACACC AGTC GC AGGAGT 
c KTKPRBEQYNSTYRVVSVLT- 

CCGTCCTGCACCAGGACTCGCTGAATGGCAAGGAGTACAAGTGCAAGGTCTCCAACAAAG 

301 ♦ + -+=-- ♦ + + 360 

GGCAGGACGTGGTCCTGACCGACTTACCGTTCCTCATGTTCACGTTCCAGAGGTTCT 
C VLHQDWLMGKBYKCKV SNKA- 

CCCTCCCAGCCCCCATCGAGAAAACCATCTCCAAAGCCAAAGGGCACCCCCGAGAACCAC 

361 + ♦ i ♦ ♦ ♦ 420 

G<X^GGGTCGGGGGTAGCTCTTTTGGTAGAGGTTTCGGTTTCCCGTCGGGG 
c LPAPIEKTISKAKGQPREPQ- 

AGGTGTACACCCTGCCTCCATCCCGGGATGAGCTGACCAAGAACCAGGTCAGCCTGACCT 
421 ♦ + * + + + 480 

TCCACATGTGGGACGGAGGTAGGCCCCTACTCGACTGCTTCTTGGTCCAGTCGGACTGGA 
c VYTLPPSRDELTKNQV3LTC- 

GCCTGCTCAAACGCTTCTATCCCAGCGACATCGCCGTGGAGTGGGAGAGCAATGGGCAGC 
481 + ♦ ♦ ♦ + + 540 

CGGACCAGTTTCCGAAGATAGGGTCGCTGTAGCGGCACCTCACCCTCTCGTTACCCGTCG 
C LVKGPY PSD IAVEWESNGQ P* 

CGGAGAACAACTACAAGACCACGCCTCCCGTGCTGGACTCCGACGGCTCCTTCTTCCTCT 

541 ♦ ♦ * + + + 600 

GC CTCTTGTTG ATGTTCTGGTGCGG AGGGC ACG AC CTG AGGCTGCC G AGG AAGAAGG AG A 
C EMNYKTTPPVLD3DGSPFLY* 

ACAGCAAGCTCACCGTGGACAAGAGCACGTGGCAGCAGGGGAACGTCTTCTCATGCTCCG 
601 ♦ + ♦ + ♦ + 660 

TGTCGTTCGAGTGGCACCTGTTCTCGTCCACCGTCGTCCCCTTGCAGAAGAGTACGAGGC 
C 3 KtfTVDKSRWQQGNVFSC S V - 

TGATGCATGAGOCTCTGCACAACCACTACACGCAGAAGAGCCTCTCCCTGTCTCCGGGTA 

661 ♦ ♦ + + ♦ + 720 

ACTACGTACTCCGAGACGTGTTGGTGATGTGCGTCTTCTCGGAGAGGGACAGAGGCCCAT 
C MHEALHNHYTQKSLSLS PGR* 

AAGGTGGAGGTGGTGGCGGAGGTACTTACTCTTGCCACTTCGGCCCACTGACTTGGGTTT 

721 + ♦ + + ♦ ♦ 780 

TTCCACCTCCACCACCGCCTCCATGAATGAGAACGGTGAAGCCGGGTGACTGAACCCAAA 
c GGGGGGG'TYSCHFGPLT~WVC- 

GCAAACCGCAGGGTGGCGGCGGCGGCGGCGGTGGTACCTATTCCTGTCATTTTGGCCCGC 

781 «• ♦ + ♦ «■ + 840 

CGTTTGGCGTCCCACCGCCGCCGCCGCCGCCACCATGGATAAGGACAGTAAAACCGGGCG 
c KPQGGGGGGGGTYSCHFGPL- 

BamHI 
I 

TG AC CTGGGT ATGT AAGCC AC AAGGGGGTT AATCTCG AGG ATCC 

841 ♦ ♦ ♦ * 884 

ACTGGACCCATACATTCGGTGTTCCCCCAATTAGAGCTCCTAGG 
C TWVC RPQGG * 
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FIG. 17 A 

(Afl£II sticky end) 5' GCGTAACGTATGCATGGTCTCC • 

(position #4358 in pAMG21) 3 1 TGCACGCATTGCATACGTACCAGAGG - 

- CCATGCGAGAGTAGGGAACTGCC AGGCATCAAATAAAACGAAAGGCTCAGTCGAAAGACT - 

- GGTACGCTCTCATCCCTTGACGGTCCGTAGTTTATTTTGCTTTCCGAGTCAGCTTTCTGA - 

• GGGCCTTTCGTTTTATCTGTTGTTTGTCGGTGAACGCTCTCCTGAGTAGGACAAATCCGC - 

• CCCGGAAAGCAAAATAGACAACAAACAGCCACTTGCGAGAGGACTC ATCCTGTTTAGGCG - 

- C GGG AGC GG ATTTGAAC GTTGC GAAGC AACGGC C C GGAGGGTGGC GGGC AGGAC GC C C GC • 

- GCCCTCGCCTAAACTTGCAACGCTTCGTTGCCGGGCCTCCCACCGCCCGTCCTGCGGGCG - 

• CATAAACTGCCAGGCATCAAATTAAGCAGAAGGCCATCCTGACGGATGGCCTTTTTGCGT - 

- GTATTTGACGGTCCGTAGTTTAATTCGTCTTCCGGTAGGACTGCCTACCGGAAAAACGCA - 

Aatu 

• TTC T AC AAAC TC TTTTGTTT ATTTTTC TAAATAC ATTC AAATATGGACGTCGT AC TTAAC - 

- AAGATGTTTGAGAAAACAAATAAAAAGATTTATGTAAGTTTATACCTGCAGCATGAATTG - 

• TTTTAAAGT ATGGGC AATC AATTGCTC CTGTT AAAATTGCTTTAGAAATAC TTTGGC AGC • 

- AAAATTTCATACCCGTTAGTTAACGAGGACAATTTTAACGAAATCTTTATGAAACCGTCG - 

- GGTTTGTTGT ATTGAGTTTCATTTGC GC ATTGGTT AAATGGAAAGTGACC GTGC GC TT AC • 

• C C AAAC AAC AT AAC TC AAAGT AAAC GC GT AACC AATTTAC C TTTC AC TGGC AC GC GAATG - 

- TACAGCCTAATATTTTTGAAATATCCCAAGAGCTTTTTC - 

- ATGTCGGATTATAAAAACTTTATAGGGTTCTCGAAAAAGGAAGCGTACGGGTGCGATTTG - 

- ATTCTTTTTCTCTTTTGGTTAAATCGTTGTTTGATTTATTA - 

- TAAGAAAAAGAGAAAACCAATTTAGCAACAAACTAAATAATAAACGATATAAATAAAAAG - 

- GATAATTATCAACTAGAGAAGGAACAATTAATGGTATGTTCATACACGCATGTAAAAATA - 

- CTATTAATAGTTGATCTCTTCCTTGTTAATTACCATACAAGTATGTGCGTACATTTTTAT - 

- AACTATCTATATAGTTGTCTTTCTCTGAATGTGCAAAACTAAGCATTCCGAAGCCATTAT - 

- TTGAT AGAT AT ATCAAC AG AAAGAGACTTAC AC GTTTTGATTC GTAAGGCTTC GGTAAT A - 

- TAGC AGTATGAAT AGGG AAAC TAAACCC AGTGATAAGACC TGATGATTTC GC TTCTTT AA - 

- ATC GTC AT ACTTATCCC TTTGATTTGGGTC AC TATTCTGGACTACTAAAGC GAAGAAATT - 

- TTAC ATTTGGAGATTTTTTATTTACAGCATTGTTTTC AAATATATTCCAATTAATCGGTG - 

• AATGT AAAC C TC TAAAAAAT AAATGTC GT AAC AAAAGTTT ATAT AAGGTTAATT AGCC AC • 

• AATGATTGGAGTTAGAATAATCTACTATAGGATCATATTTTATTAAATTAGCGTCATCAT - 
. TTACTAACCTCAATCTTATTAGATGATATCCTAGTATAAAATAATTTAATCGCAGTAGTA - 

- AATATTGCCTCCATTTTTTAGGGTAATTATCCAGAATTGAAATATC AGATTTAACCATAG - 

- TTATAACGGAGGTAAAAAATCCCATTAATAGGTCTTAACTTTATAGTCTAAATTGGTATC - 

• AATGAGGATAAATGATCGCGAGTAAATAATATTCACAATGTACCATTTTAGTCATATCAG - 

• TTACTCCTATTTACTAGCGCTCATTTA1TATAAGTGTTACATGGTAAAATCAGTATAGTC - 

- AT AAGC ATTGATT AAT ATC ATTATTGC TTC TACAGGC TTTAATTTT ATT AATT ATTC TGT - 

• TATTCGTAACTAATTATAGTAATAACGAAGATGTCCGAAATTAAAATAATTAATAAGACA - 

- AAGTGTCGTCGGCATTTATGTCTTTCATACCCATCTCTTTATCCTTACCTATTGTTTGTC - 

• TTC ACAGCAGCCGTAAATACAGAAAGTATGGGTAGAGAAATAGGAATGGATAACAAACAG - 

■ GC AAGTTTTGCGTGTTATATATCATTAAAACGGTAATAGATTGAC ATTTGATTCTAATAA - 

- CGTTCAAAACGCACAAT ATAT AGTAATTTTGCCATTATCTAACTGTAAACTAAGATT ATT - 
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- ATTGGATTTTTGTCACACTATTATATCGCTTGAAATACAATTGTTTAACATAAGTACCTG - 

- TAACCTAAAAACAGTGTGATAATATAGCGAACTTTATGTTAACAAATTGTATTCATGGAC - 

- TAGGATCGTACAGGTTTACGCAAGAAAATGGTTTGTTATAGTCGATTAATCGATTTGATT - 

• ATCCTAGCATGTCCAAATGCGTTCTTTTACCAAACAATATCAGCTAATTAGCTAAACTAA - 

- CTAGATTTGTTTTAACTAATTAAAGGAGGAATAACATATGGTTAACGCGTTGGAATTCGA - 

- GATCTAAACAAAATTGATTAATTTCCTCCTTATTGTATACCAATTGCGCAACCTTAAGCT - 

• GCTCACTAGTGTCGACCTGCAGGGTACCATGGAAGCTTACTCGAGGATCCGCGGAAAGAA - 

• CGAGTGATCACAGCTGGACGTCCCATGGTACCTTCGAATGAGCTCCTAGGCGCCTTTCTT - 

- GAAGAAGAAGAAGAAAGCCCGAAAGGAAGCTGAGTTGGCTGCTGCCACCGCTGAGCAATA - 

- CTTCTTCTTCTTCTTTCGGGCTTTCCTTCGACTCAACCGACGACGGTGGCGACTCGTTAT - 

- ACTAGCATAACCCCTTGGGGCCTCTAAACGGGTCTTGAGGGGTTTTTTGCTGAAAGGAGG - 

- TGATCGTATTGGGGAACCCCGGAGATTTGCCCAGAACTCCCCAAAAAACGACTTTCCTCC - 

- AACCGCTCTTCACGCTCTTCACGC 3' (SAfiH sticky end] 

-TTGGCGAGAAGTGCGAGAAGTG 5' (position #5904 in pAMG21) 
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FIG. 19A 



Ndel 

CATATGGACAAAACTCACACATGTCCACCTTGTCCAGCTCCGGAACTCCTGGGGGGACCG 

^ „„.------+•*•-- _ + ._-..----.- + -- •- -- -- - + -- -*-*--- + -- -- -"-"*+ 60 

GTATACCTGTTTTGAGTGTGTACAGGTGGAACAGGTCGAGGCCTTGAGGACCCCCCTGGC 

MDKTHTCPPCPAPBLL GGP 

TCAGTCTTCCTCTTCCCCCCAAAACCCAAGGACACCCTCATGATCTCCCGGACCCCTGAG 

gi - - + + ♦ + * + ^20 

AGTCAGAAGGAGAAGGGGGGTTTTGGGTTCCTGTGGGAGTACTAGAGGGCCTGGGGACTC 

SVFLFPPKPKDTLMISRTPE 

GTCACATGCGTGGTGGTGGACGTGAGCCACGAAGACCCTGAGGTCAAGTTCAACTGGTAC 

121 * + + + + + 180 

CAGTGTACGCACCACCACCTGCACTCGGTGCTTCTGGGACTCCAGTTCAAGTTGACCATG 

VTCVVVDVSHEDPEVKFNWY 

GTGGACGGCGTGGAGGTGCATAATGCCAAGACAAAGCCGCGGGAGGAGCAGTACAACAGC 

181 + + ♦ ♦ + + 240 

CACCTGCCGCACCTCCACGTATTACGGTTCTGTTTCGGCGCCCTCCTCGTCATGTTGTCG 

VDGVEVHNAKTKPREEQYNS 

ACGTACCGTGTGGTCAGCGTCCTCACCGTCCTGCACCAGGACTGGCTGAATGGCAAGGAG 

241 + + + + + + 300 

TGCATGGCACACCAGTCGCAGGAGTGGCAGGACGTGGTCCTGACCGACTTACCGTTCCTC 

TYRVVSVLTVLHQDWLNGKE 

TACAAGTGCAAGGTCTCCAACAAAGCCCTCCCAGCCCCCATCGAGAAAACCATCTCCAAA 

_ ni _ + + 4..--- + + + 360 

ATGTTCACGTTCCAGAGGTTGTTTCGGGAGGGTCGGGGGTAGCTCTTTTGGTAGAGGTTT 

YKCKVSNKALPAPIEKTISK 

GCCAAAGGGCAGCCCCGAGAACCACAGGTGTACACCCTGCCCCCATCCCGGGATGAGCTG 

-.gi ♦ + + + + + iZ0 

CGGTTTCCCGTCGGGGCTCTTGGTGTCCACATGTGGGACGGGGGTAGGGCCCTACTCGAC 

AKGQPREPQVYTLPP3RDEL 

ACCAAGAACCAGGTCAGCCTGACCTGCCTGGTCAAAGGCTTCTATCCCAGCGACATCGCC ^ 

4 2 1 TGGTTCTTGGTCCAGTCGGACTGGACGGACCAGTTTCCGAAGATAGGGTCGCTGTAGCGG 
TKNQVSLTCLVKGPYPSDIA 
GTGGAGTGGGAGAGCAATGGGCAGCCGGAGAACAACTACAAGACCACGCCTCCCGTGCTG ^ 

4 8 1 CACCTCACCCTCTCGTTACCCGTCGGCCTCTTGTTGATGTTCTGGTGCGGAGGGCACGAC 
VEWE s NG Q PENNY KTTP PVL - 
GACTCCGACGGCTCCTTCTTCCTCTACAGCAAGCTCACCGTGGACAAGAGCAGGTGGCAG 

54 1 CTGAGGCTGCCGAGGAAGAAGGAGATGTCGTTCGAGTGGCACCTGTTCTCGTCCACCGTC 
DSDGSFFLYSKLTVDKSRWQ.- 
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FIG. 19B 

CAGGGGAACGTCTTCTCATGCTCCGTGATGCATGAGGCTCTGCACAACCACTACACGCAG 

601 ♦ + * + ----♦..-.-.-..+ 660 

GTCCCCTTGCAGAAGAGTACGAGGCACTACGTACTCCGAGACGTGTTGGTGATGTGCGTC 

QGNVFSCSVMHEALHNHYTQ 

AAGAGCCTCTCCCTGTCTCCGGGTAAAGGTGGAGGTGGTGGTGACTTCCTGCCGCACTAC 

661 + + + + + + 720 

TTCTCGGAGAGGGACAGAGGCCCATTTCCACCTCCACCACCACTGAAGGACGGCGTGATG 

KSLSLSPGKGGGGGDFLPHY 

BamHI 
I 

AAAAACACCTCTCTGGGTCACCGTCCGTAATGGATCC 

721 * 757 

TTTTTGTGGAGAGACCCAGTGGCAGGCATTACCTAGG 

KNTSLGHRP* 
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FIG. 20A 

Ndel 

CATATGGACTTCCTGCCGCACTACAAAAACACCTCTCTGGGTCACCGTCCGGGTGGAGGC 
GTATACCTGAAGGACGGCGTGATGTTTTTGTGGAGAGACCCAGTGGCAGGCCCACCTCCG 

MDFLPHYKNTSLGHRPGGG - 
GGTGGGGACAAAACTCACACATGTCCACCTTGCCCAGCACCTGAACTCCTGGGGGGACCG ^ 
6 1 CCACCCCTGTTTTGAGTGTGTACAGGTGGAACGGGTCGTGG ACTTGAGGACCCCCCTGGC 
GGDKTHTC PPCPAPELLGGP - 
TCAGTTTTCCTCTTCCCCCCAAAACCCAAGGACACCCTCATGATCTCCCGGACCCCTGAG ^ 
1 2 1 AGTC AAAAGGAGAAGGGGGGT^GGGTTCCTGTGGGAGTACTAGAGCMCCTGGGGACTC 
SVFLFPP KPKDTLMISRTPE - 
GTCACATGCGTGGTGGTGGACGTGAGCCACGAAGACCCTGAGGTCAAGTTCAACTGGTA^ ^ ^ ^ 
1 8 1 CAGTGTACGCACCACCACCTGCACTCGGTGCTTCTGGGACTCCAGTTCAAGTTGACCATG 
VTCVVVDVSHEDPEVKFNWY ■ 

GTGGACGGCGTGGACKTrGCATAATGCCAAGACAAAGCCGC ^ 
241 CACCTGCCGCACCTCCACGTATTACGGTTCTGTTTCGGCGCCCTCCTCGTCATGTTGTCG 
VDGVEVHNAKTKPREEQYNS - 
ACGTACCGTGTGGTCAGCGTCCTCACCGTCCTGCACCAGGACTGGCTGAA ^ 

301 tgcatggcaJaccagtcgcaggagtggcaggacgtggtc^tgaccgacttaccgttcctc 

TYRVVSVLTVLHQDWLNGKE - 

TACAAGTGCAAGGTCTCCAACAAAGCCCTCCCAGCCCCCATCGAGAAAACCAT^ ^ 
3 " ATG TTCACGiTCCAGAGGTiGTTTCGGGAGGGTCGGGGGTAGCTCTTTTGGTAGAGGTTT 

YKCKV3NKALPAPIEKTISK - 

GCCAAAGGGCAGCCCCGAGAACCACAGGTGTACACCCTGCCCCCA ^ 
4 2 1 CGGTTTCCCGTCGGGGCTCTTGGTGTCCACATGTGGGACGGGGGTAGGGCCCTACTCGAC 
AKGQPREPQVYTLPPSRDEL 
ACCAAGAACCAGGTCAGCCTGACCTGCCTGGTCAAAGGCTTCTATCCCAG^ ^ 
481 tggTTCTTGGTCCAGTCGGACTGGACGGACCAGTTTCCGAAGATAGGGTCGCTGTAGCGG 
TKNQVSLTCLVK GFYPSDIA - 
GTGGAGTGGGAGAGCAATGGGCAGCCGGAGAACAACTACAAGACCACGCCTCCCGTGCTG ^ 
541 CACCTCACCCTCTCGTTACCCGTCGGCCTCTTGTTGATGTTCTGGTGCGGAGGGCACGAC 



VEWESNGQPENNYKTTPPVL 
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FIG. 20B 

GACTCCGACGGCTCCTTCTTCCTCTACAGCAAGCTCACCGTGGACAAGAGCAGGTGGCAG 

601 + ♦ + + + + 660 

CTGAGGCTGCCGAGGAAGAAGGAGATGTCGTTCGAGTGGCACCTGTTCTCGTCCACCGTC 

D S DGS F FLYS KLTVDK S RWQ 

CAGGGGAACGTCTTCTCATGCTCCGTGATGCATGAGGCTCTGCACAACCACTACACGCAG 

661 + + + + + + 720 

GTCCCCTTGCAGAAGAGTACGAGGCACTACGTACTCCGAGACGTGTTGGTGATGTGCGTC 

QGNVFSCSVMHEALHNHYTQ 



BamHI 
I 

AAGAGCCTCTCCCTGTCTCCGGGTAAATAATGGATCCGCGG 

721 + + + + - 761 

TTCTCGGAGAGGGACAGAGGCCCATTTATTACCTAGGCGCC 

KSLSLSPGK* 
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FIG. 21A 



Ndel 

CATATGGACAAAACTCACACATGTCCACCTTGTCCAGCTCCGGAACTCCTGGGGGGACCG 

l + + + + + • + 60 

GTATACCTGTTTTGAGTGTGTACAGGTGGAACAGGTCGAGGCCTTGAGGACCCCCCTGGC 



MDKTHTCPPCPAPELLGGP 

TCAGTCTTCCTCTTCCCCCCAAAACCCAAGGACACCCTCATGATCTCCCGGACCCCTGAG 

61 + + + + + + 120 

AGTCAGAAGGAGAAGGGGGGTTTTGGGTTCCTGTGGGAGTACTAGAGGGCCTGGGGACTC 

SVFLFPPKPKDTLMISRTPE 

GTCACATGCGTGGTGGTGGACGTGAGCCACGAAGACCCTGAGGTCAAGTTCAACTGGTAC 

121 ♦ + ♦ 180 

CAGTGTACGCACCACCACCTGCACTCGGTGCTTCTGGGACTCCAGTTCAAGTTGACCATG 

VTCVVVDVSHEDPEVKFNWY 

GTGGACGGCGTGGAGGTGCATAATGCCAAGACAAAGCCGCGGGAGGAGCAGTACAACAGC 

181 + + + +-----•--- + + 240 

CACCTGCCGCACCTCCACGTATTACGGTTCTGTTTCGGCGCCCTCCTCGTCATGTTGTCG 

VDGVEVHNAKTKPREEQYNS 

ACGTACCGTGTGGTCZAGCGTCCTCACCGTCCTGCACCAGGACTGGCTGAATGGCAAGGAG ^ 

241 TGCATGGCACACCAGTCGCAGGAGTGGCAGGACGTGGTCCTGACCGACTTACCGTTCCTC 

TYRVVSVLTVIiHQDWLNGKE 

TACAAGTGCAAGGTCTCCAACAAAGCCCTCCCAGCCCCCATCGAGAAAACCATCTCCAAA 

301 + + + * + + 360 

ATGTTCACGTTCCAGAGGTTGTTTCGGGAGGGTCGGGGGTAGCTCTTTTGGTAGAGGTTT 

YKCKVSNKALPAPIEKTISK 

GCCAAAGGGCAGCCCCGAGAACCACAGGTGTACACCCTGCCCCCATCCCGGGATGAGCTG 

361 + + + + ♦ + 420 

CGGTTTCCCGTCGGGGCTCTTGGTGTCCACATGTGGGACGGGGGTAGGGCCCTACTCGAC 

AKGQPREPQV. YTLPPSRDEL 

ACCAAGAACCAGGTCAGCCTGACCTGCCTGGTCAAAGGCTTCTATCCCAGCGA^ ^ ^ ^ 

421 TGGTTCTTGGTCCAGTCGGACTGGACGGACCAGTTTCCGAAGATAGGGTCGCTGTAGCGG 



T K 



NQVSLTCLVKGFYPS DIA 



GTGGAGTGGGAGAGCAATGGGCAGCCGGAGAACAACTACAAGACCACGCCTCCCGTGCTG 

481 + + -• + + + .540 

CACCTCACCCTCTCGTTACCCGTCGGCCTCTTGTTGATGTTCTGGTGCGGAGGGCACGAC 

VEWESNGQPENNYKTTPPVL - 

GACTCCGACGGCTCCTTCTTCCTCTACAGC AAGCTCACCGTGGACAAGAGCAGGTGGCAG 

541 CTGAGGCTGCCGAGGAAGAAGGAGATGTCGTTCGAGTGGCACCTGTTCTCGTCCACCGTC 

DSDGSFPLYSKLTVDKSRWQ 
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FIG. 21B 



CAGGGGAACGTCTTCTCATGCTCCGTGATGCATGAGGCTCTGCACAACCACTACACGCAG 

601 + + + + + + 660 

GTCCCCTTGCAGAAGAGTACGAGGCACTACGTACTCCGAGACGTGTTGGTGATGTGCGTC 

QGNVFSCSVMHEALHNHYTQ " 

AAGAGCCTCTCCCTGTCTCCGGGTAAAGGTGGAGGTGGTGGTTTCGAATGGACCCCGGGT 

" 1 TT cTCGGAGAGGGACAGAGGCCCATTTCCACCTCCACCACCAAAGCTTACCTGGGGCCCA 

KSLSLSP. GKGGGGGFEWTPG 

BamHX 
I 

TACTGGCAGCCGTACGCTCTGCCGCTGTAATGGATCCCTCGAG 

721 + + + + " 763 

ATGACCGTCGGCATGCGAGACGGCGACATTACCTAGGGAGCTC 

YWQPYALPL* 
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FIG. 22A 



Ndel 
I 

CATATGTTCGAATGGACCCCGGGTTACTGGCAGCCGTACGCTCTGCCGCTGGGTGGAGGC 

1 ♦ + ♦ + + + 60 

GTATACAAGCTTACCTGGGGCCCAATGACCGTCGGCATGCGAGACGGCGACCCACCTCCG 

MFEWT PGYWQPYALPLGGG 

GGTGGGGACAAAACTCACACATGTCCACCTTGCCCAGCACCTGAACTCCTGGGGGGACCG 

61 * + + + ♦ + 120 

CCACCCCTGTTTTGAGTGTGTACAGGTGGAACGGGTCGTGGACTTGAGGACCCCCCTGGC 

GGDKTHTCPPCPAPELLGGP 

TCAGTTTTCCTCTTCCCCCCAAAACCCAAGGACACCCTCATGATCTCCCGGACCCCTGAG 

121 +• - - + + ♦ + + 180 

AGTCAAAAGGAGAAGGGGGGTTTTGGGTTCCTGTGGGAGTACTAGAGGGCCTGGGGACTC 

SVFLFPPKPKDTLMISRTPE 

GTCACATGCGTGGTGGTGGACGTGAGCCACGAAGACCCTGAGGTCAAGTTCAACTGGTAC 

181 + + ♦ + + ♦ 240 

CAGTGTACGCACCACCACCTGCACTCGGTGCTTCTGGGACTCCAGTTCAAGTTGACCATG 

VTCVVVDVSHEDPEV KFNWY 

GTGGACGGCGTGGAGGTGCATAATGCCAAGACAAAGCCGCGGGAGGAGCAGTACAACAGC 

241 + + + + + + 300 

CACCTGCCGCACCTCCACGTATTACGGTTCTGTTTCGGCGCCCTCCTCGTCATGTTGTCG 

VDGVEVHNAKTKPREEQYNS 

ACGTACCGTGTGGTCAGCGTCCTCACCGTCCTGCACCAGGACTGGCTGAATGGCAAGGAG 

301 + + + + + + 360 

TGCATGGCACACCAGTCGCAGGAGTGGCAGGACGTGGTCCTGACCGACTTACCGTTCCTC 

TYRVVSVLTVLHQDWLNGKE 

TACAAGTGCAAGGTCTCCAACAAAGCCCTCCCAGCCCCCATCGAGAAAACCATCTCCAAA 

361 + + .....♦.---..--.«...---.--. + 420 

ATGTTCACGTTCCAGAGGTTGTTTCGGGAGGGTCGGGGGTAGCTCTTTTGGTAGAGGTTT 

YKCKVSNKALPAPIEKTISK 

GCCAAAGGGCAGCCCCGAGAACCACAGGTGTACACCCTGCCCCCATCCCGGGATGAGCTG 

421 + + + + + + 480 

CGGTTTCCCGTCGGGGCTCTTGGTGTCCACATGTGGGACGGGGGTAGGGCCCTACTCGAC 

AKGQPREPQVYTLPPSRDEL 

ACCAAGAACCAGGTCAGCCTGACCTGCCTGGTCAAAGGCTTCTATCCCAGCGACATCGCC 

481 ♦ + ... + + 540 

TGGTTCTTGGTCCAGTCGGACTGGACGGACCAGTTTCCGAAGATAGGGTCGCTGTAGCGG 

TKNQVS LTCLVKGFYPSDIA 

GTGGAGTGGGAGAGCAATGGGCAGCCGGAGAACAACTACAAGACCACGCCTCCCGTGCTG 

54! + ♦ + + + + 600 

CACCTCACCCTCTCGTTACCCGTCGGCCTCTTGTTGATGTTCTGGTGCGGAGGGCACGAC 

VEWESNGQPENNYKTTPPVL 
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FIG. 22B 



GACTCCGACGGCTCCTTCTTCCTCTACAGCAAGCTCACCGTGGACAAGAGCAGGTGGCAG 

601 + + + + ♦ + 660 

CTGAGGCTGCCGAGGAAGAAGGAGATGTCGTTCGAGTGGCACCTGTTCTCGTCCACCGTC 

DSDGSFFLYSKLTVDKSRWQ 

CAGGGGAACGTCTTCTCATGCTCCGTGATGCATGAGGCTCTGCACAACCACTACACGCAG 

661 + + + ♦ + + 720 

GTCCCCTTGCAGAAGAGTACGAGGCACTACGTACTCCGAGACGTGTTGGTGATGTGCGTC 

QGNVFSCSVMHEALHNHYTQ 

BamHI 

I 

AAGAGCCTCTCCCTGTCTCCGGGTAAATAATGGATCC 

721 + + + 757 

TTCTCGGAGAGGGACAGAGGCCCATTTATTACCTAGG 

KSLSLSPGK* 
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FIG. 23A 

Ndel 



I 

CATATGGACAAAACTCACACATGTCCACCGTGCCCAGCACCTGAACTCCTGGGGGGACCG 

i + * ♦ + + 60 

GTATACCTGTTTTGAGTGTGTACAGGTGGCACGGGTCGTGGACTTGAGGACCCCCCTGGC 

MDKTHTCPPCPAPELLGGP 

TCAGTTTTCCTCTTCCCCCCAAAACCCAAGGACACCCTCATGATCTCCCGGACCCCTGAG 

61 * + + ♦ + + 120 

AGTCAAAAGGAGAAGGGGGGTTTTGGGTTCCTGTGGGAGTACTAGAGGGCCTGGGGACTC 

SVFLFPPKPKDTLMI SRTPE. 

GTCACATGCGTGGTGGTGGACGTGAGCCACGAAGACCCTGAGGTCAAGTTCAACTGGTAC 

121- + + ♦ + + + 180 

CAGTGTACGCACCACCACCTGCACTCGGTGCTTCTGGGACTCCAGTTCAAGTTGACCATG 

VTCVVVDVSHED PEVKFNWY 

GTGGACGGCGTGGAGGTGCATAATGCCAAGACAAAGCCGCGGGAGGAGCAGTACAACAGC 

181 + + --•• + + + + 240 

CACCTGCCGCACCTCCACGTATTACGGTTCTGTTTCGGCGCCCTCCTCGTCATGTTGTCG 

VDGVEVHNAKTKP. REEQYNS 

ACGTACCGTGTGGTCAGCGTCCTCACCGTCCTGCACCAGGACTGGCTGAATGGCAAGGAG 

241 + + + + + + 300 

TGCATGGCACACCAGTCGCAGGAGTGGCAGGACGTGGTCCTGACCGACTTACCGTTCCTC 

TYRVVSVLTVLHQDWLN GKE 

TACAAGTGCAAGGTCTCCAACAAAGCCCTCCCAGCCCCCATCGAGAAAACCATCTCCAAA 

301 + + + * + + 360 

ATGTTCACGTTCCAGAGGTTGTTTCGGGAGGGTCGGGGGTAGCTCTTTTGGTAGAGGTTT 

YKCKVSNKALPAPIEKTISK 

GCCAAAGGGCAGCCCCGAGAACCACAGGTGTACACCCtGCCCCCATCCCGGGATGAGCTG 

361 + ♦ + + + ♦ 420 

CGGTTTCCCGTCGGGGCTCTTGGTGTCCACATGTGGGACGGGGGTAGGGCCCTACTCGAC 

AKGQPREPQVYTLPPSRDEL 

ACCAAGAACCAGGTCAGCCTGACCTGCCTGGTCAAAGGCTTCTATCCCAGCGACATCGCC 

421 + + + + + • - 480 

TGGTTCTTGGTCCAGTCGGACTGGACGGACCAGTTTCCGAAGATAGGGTCGCTGTAGCGG 

TKNQVSLTCLVKGFYPSDIA 

GTGGAGTGGGAGAGCAATGGGCAGCCGGAGAACAACTACAAGACCACGCCTCCCGTGCTG 

481 --- + + + ♦ — - + -_ + 540 

CACCTCACCCTCTCGTTACCCGTCGGCCTCTTGTTGATGTTCTGGTGCGGAGGGCACGAC 

VEWESNGQPENNYKTTPPVL 

GACTCCGACGGCTCCTTCTTCCTCTACAGCAAGCTCACCGTGGACAAGAGCAGGTGGCAG 

541 + + + + + ♦ 600 

CTGAGGCTGCCGAGGAAGAAGGAGATGTCGTTCGAGTGGCACCTGTTCTCGTCCACCGTC 



DSDGSFFLYSKLTVDKSRWQ 
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FIG. 23B 



CAGGGGAACGTCTTCTCATGCTCCGTGATGCATGAGGCTCTGCACAACCACTACACGCAG 

601 + + + + + + 660 

GTCCCCTTGCAGAAGAGTACGAGGCACTACGTACTCCGAGACGTGTTGGTGATGTGCGTC 

QGNVFSCSVMHEALHNHYTQ 

AAGAGCCTCTCCCTGTCTCCGGGTAAAGGTGGTGGTGGTGGTGTTGAACCGAACTGTGAC 

661 + + + + + + 720 

TTCTCGGAGAGGGACAGAGGCCCATTTCCACCACCACCACCACAACTTGGCTTGACACTG 

KSLSLSPGKGGGGGVEPNCD 

BamHI 
I 

ATCCATGTTATGTGGGAATGGGAATGTTTTGAACGTCTGTAACTCGAGGATCC 

721 ♦ + + ♦ + --- 773 

TAGGTACAATACACCCTTACCCTTACAAAACTTGCAGACATTGAGCTCCTAGG 

I HVMWEWECFERL* 
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FIG. 24A 



Ndel 



CATATGGTTGAACCGAACTGTGACATCCATGTTATGTGGGAATGGGAATGTTTTGAACGT 

GTATACCAACTTGGCTTGACACTGTAGGTACAATACACCCTTACCCTTACAAAACTTGCA 

MVEPNCDIHVMWEWBCFER 

CTGGGTGGTGGTGGTGGTGACAAAACTCACACATGTCCACCGTGCCCAGCACCTGAACTC 

61 + + + - + ♦ + 120 

GACCCACCACCACCACCACTGTTTTGAGTGTGTACAGGTGGCACGGGTCGTGGACTTGAG 

LGGGGGD KTHTCPPCPAPEL 

CTGGGGGGACCGTCAGTTTTCCTCTTCCCCCCAAAACCCAAGGACACCCTCATGATCTCC 

121 ♦ + + + + + 180 

GACCCCCCTGGCAGTCAAAAGGAGAAGGGGGGTTTTGGGTTCCTGTGGGAGTACTAGAGG 

LGGPSVFLFPPKPKDTLMIS 

CGGACCCCTGAGGTCACATGCGTGGTGGTGGACGTGAGCCACGAAGACCCTGAGGTCAAG 
181 + + -..- + + + + 240 

GCCTGGGGACTCCAGTGTACGCACCACCACCTGCACTCGGTGCTTCTGGGACTCCAGTTC 

RTPEVTCVVVDVSHEDPEVK 

TTCAACTGGTACGTGGACGGCGTGGAGGTGCATAATGCCAAGACAAAGCCGCGGGAGGAG 

241 + + + + + + 300 

AAGTTGACCATGCACCTGCCGCACCTCCACGTATTACGGTTCTGTTTCGGCGCCCTCCTC 

FNWYVDGVEVHNAKTKPREE 

CAGTACAACAGCACGTACCGTGTGGTCAGCGTCCTCACCGTCCTGCACCAGGACTGGCTG 

301 + + + + + + 360 

GTCATGTTGTCGTGCATGGCACACCAGTCGCAGGAGTGGCAGGACGTGGTCCTGACCGAC 

QYNSTYRVVSVLTVLHQDWL 

AATGGCAAGGAGTACAAGTGCAAGGTCTCCAACAAAGCCCTCCCAGCCCCCATCGAGAAA 

361 + + + + + + 420 

TTACCGTTCCTCATGTTCACGTTCCAGAGGTTGTTTCGGGAGGGTCGGGGGTAGCTCTTT 

NGKEYKCKVSNKALPAPIEK 

ACCATCTCCAAAGCCAAAGGGCAGCCCCGAGAACCACAGGTGTACACCCTGCCCCCATCC 

421 + + + ♦ + + 480 

TGGTAGAGGTTTCGGTTTCCCGTCGGGGCTCTTGGTGTCCACATGTGGGACGGGGGTAGG 

TISKAKGQPREPQVYTLPPS 

CGGGATGAGCTGACCAAGAACCAGGTCAGCCTGACCTGCCTGGTCAAAGGCTTCTATCCC 

481 + + + + + : + '540 

GCCCTACTCGACTGGTTCTTGGTCCAGTCGGACTGGACGGACCAGTTTCCGAAGATAGGG 

RDELTKNQVSLTCLVKGFYP 

AGCGACATCGCCGTGGAGTGGGAGAGCAATGGGCAGCCGGAGAACAACTACAAGACCACG 

541 + + ♦ + + + 600 

TCGCTGTAGCGGCACCTCACCCTCTCGTTACCCGTCGGCCTCTTGTTGATGTTCTGGTGC 

SDIAVEWESNGQPENNYKTT 
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FIG. 24B 



CCTCCCGTGCTGGACTCCGACGGCTCCTTCTTCCTCTACAGCAAGCTCACCGTGGACAAG 
601 + ♦ + + + + 660 

GGAGGGCACGACCTGAGGCTGCCGAGGAAGAAGGAGATGTCGTTCGAGTGGCACCTGTTC 

PPVLDSDGSFFLYSK LTVDK 

AGCAGGTGGCAGCAGGGGAACGTCTTCTCATGCTCCGTGATGCATGAGGCTCTGCACAAC 
661 + + + + + + 720 

TCGTCCACCGTCGTCCCCTTGCAGAAGAGTACGAGGCACTACGTACTCCGAGACGTGTTG 

S RWQQGNVFSCSVMHEALHN 

BamHZ 
I 

CACTACACGCAGAAGAGCCTCTCCCTGTCTCCGGGTAAATAACTCGAGGATCC 
721 + ♦ + + + 773 

GTGATGTGCGTCTTCTCGGAGAGGGACAGAGGCCCATTTATTGAGCTCCTAGG 
HYTQKSLSLSPGK* 
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FIG. 25A 

Ndel 



I 

CATATGGACAAAACTCACACATGTCCACCTTGTCCAGCTCCGGAACTCCTGGGGGGACCG 

1 * + * + + ♦ 60 

GTATACCTGTTTTGAGTGTGTACAGGTGGAACAGGTCGAGGCCTTGAGGACCCCCCTGGC 

MDKTHTCPPCPAPELLGGP 

TCAGTCTTCCTCTTCCCCCCAAAACCCAAGGACACCCTCATGATCTCCCGGACCCCTGAG 

61 * * + * + ♦ 120 

AGTCAGAAGGAGAAGGGGGGTTTTGGGTTCCTGTGGGAGTACTAGAGGGCCTGGGGACTC 

SVFLFPPKPKDTLMISRTPE 

GTCACATGCGTGGTGGTGGACGTGAGCCACGAAGACCCTGAGGTCAAGTTCAACTGGTAC 

121 + + + + + + 180 

CAGTGTACGCACCACCACCTGCACTCGGTGCTTCTGGGACTCCAGTTCAAGTTGACCATG 

VTCVVVDVSHEDPEVKFNWY 

GTGGACGGCGTGGAGGTGCATAATGCCAAGACAAAGCCGCGGGAGGAGCAGTACAACAGC 

181 + + + + + + 240 

CACCTGCCGCACCTCCACGTATTACGGTTCTGTTTCGGCGCCCTCCTCGTCAT6TTGTCG 

VDGVEVHNAKTKPREEQYNS 

ACGTACCGTGTGGTCAGCGTCCTCACCGTCCTGCACCAGGACTGGCTGAATGGCAAGGAG 

241 * + ♦ ♦ + + 300 

TGCATGGCACACCAGTCGCAGGAGTGGCAGGACGTGGTCCTGACCGACTTACCGTTCCTC 

TYRVVSVLTVLHQD WLNGKE 

TACAAGTGCAAGGTCTCCAACAAAGCCCTCCCAGCCCCCATCGAGAAAACCATCTCCAAA 

301 + + + + + + 360 

ATGTTCACGTTCCAGAGGTTGTTTCGGGAGGGTCGGGGGTAGCTCTTTTGGTAGAGGTTT 

YKCKVSNKALPAPIEKTISK 

GCCAAAGGGCAGCCCCGAGAACCACAGGTGTACACCCTGCCCCCATCCCGGGATGAGCTG 

361 + * + + ♦ + 420 

CGGTTTCCCGTCGGGGCTCTTGGTGTCCACATGTGGGACGGGGGTAGGGCCCTACTCGAC 

AKGQPREPQVYTLPPSRDEL 

ACCAAGAACCAGGTCAGCCTGACCTGCCTGGTCAAAGGCTTCTATCCCAGCGACATCGCC 

421 + + + + + + 480 

TGGTTCTTGGTCCAGTCGGACTGGACGGACCAGTTTCCGAAGATAGGGTCGCTGTAGCGG 

TKNQVSLTCLVKGFYPSDIA 

GTGGAGTGGGAGAGCAATGGGCAGCCGGAGAACAACTACAAGACCACGCCTCCCGTGCTG 

481 --■ + + - -. + + + + 540 

CACCTCACCCTCTCGTTACCCGTCGGCCTCTTGTTGATGTTCTGGTGCGGAGGGCACGAC 

VEWESNGQPENNYKTTPPVL 

GACTCCGACGGCTCCTTCTTCCTCTACAGCAAGCTCACCGTGGACAAGAGCAGGTGGCAG 

541 + ♦ + + ♦ ♦ 600 

CTGAGGCTGCCGAGGAAGAAGGAGATGTCGTTCGAGTGGCACCTGTTCTCGTCCACCGTC 



DSDGSFFLYSKLTVDKSRWQ 
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CAGGGGAACGTCTTCTCATGCTCCGTGATGCATGAGGCTCTGCACAACCACTACACGCAG 

601 + *" + + + + 660 

GTCCCCTTGCAGAAGAGTACGAGGCACTACGTACTCCGAGACGTGTTGGTGATGTGCGTC 

QGNVFSCSVMHEALHNHYTQ 

AAGAGCCTCTCCCTGTCTCCGGGTAAAGGTGGAGGTGGTGGTTGCACCACCCACTGGGGT 

661 + + - - - + + + ♦ 720 

TTCTCGGAGAGGGACAGAGGCCCATTTCCACCTCCACCACCAACGTGGTGGGTGACCCCA 

K S LS LS PGK GGGGGCTTHWG 

BamHI 
I 

TTCACCCTGTGCTAATGGATCCCTCGAG 

721 + + 748 

AAGTGGGACACGATTACCTAGGGAGCTC 

F T L C * 
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FIG. 26A 



Ndel 

CATATGTGCACCACCCACTGGGGTTTCACCCTGTGCGGTGGAGGCGGTGGGGACAAAGGT 
GTATACACGTGGTGGGTGACCCCAAAGTGGGACACGCCACCTCCGCCACCCCTGTTTCCA 
MCTTHWGFTLCGGGGGDKG 

GGAGGCGGTGGGGACAAAACTCACACATGTCCACCTTGCCCAGCACCTGAACTCCTGGGG 

gi ♦ + ♦ ♦ + **" 120 

CCTCCGCCACCCCTGTTTTGAGTGTGTACAGGTGGAACGGGTCGTGGACTTGAGGACCCC 



G G 



GGDKTHTCPPCPAPELLG 



GGACCGTCAGTTTTCCTCTTCCCCCCAAAACCCAAGGACACCCTCATGATCTCCCGGACC ^ ^ 
121 CCTGGCAGTCAAAAGGAGAAGGGGGGTTTTGGGTTCCTGTGGGAGT^ 

GPSVFLFPPKPKDTLMISRT 

CCTGAGGTCAC ATGCGTGGTGGTGGACGTGAGCC ACGAAGACCCTGAGGTCAAGTTCAAC ^ ^ 

1 8 1 GGACTCCAGTGTACGCACCACCACCTGCACTC 

PEVTCVVVDVSHEDPEVKFN 

TGGTACGTGGACGGCGTGGAGGTGCATAATGCCAAGACAAAGCCGCGGGAGGAGC AGTAC 

241 + + + + «■ + 300 

ACCATGCACCTGCCGCACCTCCACGTATTACGGTTCTGTTTCGGCGCCCTCCTCGTCATG 

WYVDGVEVHNAKTKPREEQY 

AACAGC ACGTACCGTGTGGTCAGCGTCCTC ACCGTCCTGC ACCAGGACTGGCTGAATGGC 

301 TTGTCGT^ATGGCACACCAGTCGCAGGAGTGGCAGGACGTGGTCCTGACCGACTTACCG 

NSTYRVVSVLTVLHQDWLNG 

AAGGAGTACAAGTGCAAGGTCTCCAACAAAGCCCTCCCAGCCCCCATCGAGAAAA ^ ^ q 

3 6 1 TTCCTCATGTTCACG^CCAGAGGTTGTTTCGGGAGGGTCGGGGGTAGCTCT^ 

KEYKCKVSNKALPAPIEKTI 

TCCAAAGCCAAAGGGCAGCCCCGAGAACCACAGGTGTACACCCTGCCCCCATCCCGGGAT 

d91 + + + + + + 48U 

AGGTTTCGGTTTCCCGTCGGGGCTCTTGGTGTCCACATGTGGGACGGGGGTAGGGCCCTA 

SKAKGQPREPQVYTLPPSRD • 

GAGCTGACCAAGAACCAGGTCAGCCTGACCTGCCTGGTCA^GGCTCCTATCCCAGCGAC ^ ^ ^ 

4 8 1 CTCGACTGGTTCTTGGTCCAGTCGGACTOSACGGACCAGTTTCCGAAGATAGGGTCGCTG 

ELTKNQVSLTCLVKGFYPSD • 

ATCGCCGTGGAGTGGGAGAGCAATGGGCAGCCGGAGAACAACTACAAGACCACGCCTCCC ^ ^ ^ 
541 TAGCGGCACCTCACCCTCTCGTTACCCGTCGGCCTCTTGTTGATGTTCTGGTG^ 

IAVBWESNGQPENNYKTTPP - 
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FIG.26B 

GTGCTGGACTCCGACGGCTCCTTCTTCCTCTACAGCAAGCTCACCGTGGACAAGAGCAGG 

601 + + ♦ + ♦ + 660 

CACGACCTGAGGCTGCCGAGGAAGAAGGAGATGTCGTTCGAGTGGCACCTGTTCTCGTCC 

a V LDSDGS FFLYSKLTVDKSR 

TGGCAGCAGGGGAACGTCTTCTCATGCTCCGTGATGCATGAGGCTCTGCACAACCACTAC 

661 + + ♦ + ♦ + 720 

ACCGTCGTCCCCTTGCAGAAGAGTACGAGGCACTACGTACTCCGAGACGTGTTGGTGATG 

a WQQGNVFSCSVMHEALHNHY 

BamHI 
I 

ACGCAGAAGAGCCTCTCCCTGTCTCCGGGTAAATAATGGATCC 

721 + ♦ + + - 763 

TGCGTCTTCTCGGAGAGGGACAGAGGCCCATTTATTACCTAGG 

a TQKSLSLSPG K* 
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